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PSYCHOLOGICAL CLINIC FROM NEW JERSEY 


It gives me much pleasure to introduce a series of papers representative of cur- 
rent clinical research in Community, State, and Federal institutions and educational 
centers in New Jersey. An idea suggested by my colles augue, Bernard S. Aaronson, 
sparked this collection. Clinical psychological activities in the New Jerse y State 
Department of Institutions formally began in 1919 when J. E. Wallace Wallin left 
the Vineland Training School to establish a psychological laboratory at the New 
Jersey Village for Epileptics, the site now occupied by the New Jersey Neuro- 
Psychiatric Institute. Much has happened in the dynamic fifty years that followed, 
as recorded in the special 1960 commemorative issue of the New Jersey Welfare 
Reporter, a copy of which is available on request to my office. 


The first four papers in this series relate clinical case material, reflecting the 
broad scope of New Jersey activities and close ties among State and partially State 
supported agencies and F ederal institutions. Frances Seidman reports on long-term 
out-patient therapy with an autistic youngster in a community child guidance center. 
The importance of family dynamics and the need to develop appropriate psycholog- 
ical techniques of assessment are clearly apparent from the Guerneys’ study of a dis- 
turbed young girl and her parents in the Rutgers University Psyc hological Clinic. 
Next, Erwin Friedman shows what can be done in individual therapy with a ‘“de- 
fective delinquent’, rejected by his family and relegated by society to a State colony. 
Rebecca Snyder completes the cycle with a diagnostic study of an adult veteran, 
initially diagnosed paranoid schizophrenic, whose psychosis was eventually estab- 
lished to be on a metabolic basis. 


The next group of five papers focuses on techniques and problems of evaluation. 
J. C. Brengelmann and Joseph T. Kenny note the wide intelligence quotient varia- 
tions in retardates as reported by different tests, and provide experimental evidence 
indicating that the higher intelligence the higher Performance IQ, and the lower in- 
telligence the higher Verbal 1Q. Eileen Lanfeld and David Saunders consider anxiety 
in terms of an ‘‘uncertainty effect’’, and show how a sentence arrangement test can be 
developed to measure what is currently obtained from the WAIS Object Arrange- 
ment Subtest. Harry Stein explores Rorschach reliability and offers several suggest- 
tions for its measurement. The MMPI is used by Bernard S. Aaronson and Howard 
Grumpelt to substantiate the clinical observation that some male homosexuals have 
a primarily masculine orientation and that masculinity-femininity is not a unitary 
dimension. Finally, C. M. Franks, E. A. Holden, and M. Phillips demonstrate the 
possible dangers of using self-rating personality questionnaires as measures of ob- 
jective behavior without more extensive consideration of the population under 
observation. 


Henry P. Davip, GUEST EDITOR 
New Jersey State Department of Institutions and Agencies 





OUTPATIENT TREATMENT OF A SERIOUSLY DISTURBED CHILD 
IN A CHILD GUIDANCE CENTER 


FRANCES SEIDMAN 


Child Guidance Center of Me rcer Count i 


DiaGNostic EVALUATION 

Leslie was four when she first came to the Center. She was large for her age, 
sad-eyed and dull, and she left her mother with neither a backward nor a forward 
glance. Leslie had been referred by the pediatrician because of hyperactivity and 
possible mental retardation, and she gave the immediate impression of acute emo- 
tional disturbance. Her general functioning was typified by an unrelatedness and a 
fear of being close to anyone. In the playroom she was not hyperactive; but she was 
not able to concentrate on any one activity. Her speech was infantile and suggestive 
of the schizophrenic process. She used third person pronouns in referring to herself 
and others in the room, and used echolalia and clang associations. In some respects, 
Leslie appeared to meet Bender’s description of the schizophrenic child who is 
pseudo-defective and autistic. There were more hopeful signs, however, in her 
normal motor development, her low average social maturity as measured by the 
Vineland Social Maturity Scale, and in her ability to make some object relations. 
For example, while she did not directly recognize the psychiatrist’s presence she 
placed her coat on his and said, ‘‘on his coat’’. In the psychological evaluation Leslie 
was able to do some of the picture vocabulary and several non-verbal tasks at about 
the three-year level. She scribbled when asked to draw a man and then drew a large 
black spider. She kept asking the psychologist to draw spiders, wasps, and webs 
for her. 

Leslie’s mother told of a wanted child and an easy pregnancy. She said, how- 
ever, that shortly after the birth she reacted with intense fear to the doctor’s com- 
ment, ‘I think she’s okay’’. She could hear only the word “‘think”’ and she was sure 
that something was wrong. She recalled her own tension and inability to hold the 
child. Leslie ‘felt like a board’’ when she tried to feed her. It was with relief that she 
allowed her sister, who was a registered nurse, and other members of her large family 
to take over much of the responsibility for Leslie. When Leslie was five-weeks old 
she was hospitalized for a six-day period for projectile vomiting and diarrhea. She 
went home with her aunt rather than her mother. She was both weaned and toilet 
trained by relatives on some of her frequent visits away from home. She was handed 
like a doll from person to person, ‘‘spoiled’’ and ‘‘loved’’ by everyone. Leslie had 
three adult half-siblings, children of her father, living at home. She was never fed 
by the same person twice in a row and “‘never knew what bed she was going to wake 
up in’. At about a year, Leslie was reported to have spoken single words and to bea 
happy baby. Then at fifteen months, she stopped talking. She was thought to be 
deaf since no one could “get to her’’. The withdrawal was accompanied by hyper- 
activity and restlessness. Medical check-ups, including the EEG, revealed no 
physical problem. When Leslie was three, her brother Roy was born. At four she 
had a mild case of mumps, her only physical illness. 

An attempt to evolve a suitable treatment plan for a child like Leslie raises 
questions that cannot be answered with certainty. How severe is the disturbance? 
What is the treatment of choice? How long will it take? What does the future hold? 
Despite current theories about the preference for residential treatment for the 
severely disturbed child, the diagnostic team at the Center proposed a plan for out- 


*The Child Guidance Center of Mercer County at Trenton, N. J., is an A. A. P. C. C. approved 
Clinic that uses the characteristic child guidance team approach. Following a diagnostic period and 
family conference, the parents are usually seen for weekly therapeutic interviews by a psychiatric 
social worker. The child has weekly therapy sessions with a child psychiatrist or a child psychologist. 
Appreciation is expressed to Charles R. Swift, M. D., Director of the Center, who supervised the 
therapy and took ultimate responsibility for the total treatment program. 
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patient therapy. This was done with full knowledge that such a plan would require 
more treatment hours over a longer period of time than is accorded the average 
treatment case. It was speculated that Leslie would need help, perhaps indefinitely 
and that it would be well worth the expenditure of Center time in order to study the 
value of such a continuous program. Prognostically, Leslie was seen as having fair 
chances for making gains because of the cues for relating that were noted in the 
evaluation. A summary of the on-going treatment program is presented at this 
point in order to try to understand how and why it appears to be helping Leslie. The 
focus of this paper is on the child, who has had 131 hourly treatment contacts over a 
four-year period. During the first year she was seen regularly, once and then twice 
a week. Later, sessions were spaced and for the past year she has been seen about 
once a month. The concurrent social work sessions for the mother are not being 
studied. The father was interviewed twice in the intake process.! 


TREATMENT 
Treatment for Leslie began when she first came to the Center. When the 
therapist met Leslie and her mother in the waiting room, mother reported that 
Leslie ‘“‘could hardly wait”’ to play with the doctor.’ 


T-1 She says my hand is very little and puts her hand in mine she repeats 1n a far-away 
voice “‘little hand Dr. Seidman, little hand 


Anxiety over body was to be a constant theme in treatment. One of the goals 
that evolved early was an attempt to help Leslie to develop a sense of self—to know 
her own body boundaries without needing to withdraw completely. Here, as in other 
areas of work with her, the way was shown by the child. She gave the cues, and the 
therapist tried to meet her needs. For example, Leslie frequently asked the therapist 
to draw for her, as she did at the time of evaluation, The therapist offered to draw 
her, instead of a spider and drawing Leslie became a regular part of treatment. 

T-3 She asks me to paint Leslie, and I do so, naming the body parts as I paint. She smiles 
happily and wants me to paint the sore on her arm. She ask “Who is Leslie?” and | point to 
her. She echoes this and then points to me and identifies me She goes back to ps 1inting 
and herself makes a “‘little Leslie This is a copy of mine and she seems confident as she 
makes and names her hair, neck, et 

T-8 Then she asks me to paint one of Leslie and I do, reciting the little story that goes with it 
Then comes the real surprise. She says she will make one of Dr. Seidman and she begins 
to make a painting of me 

T-12. Today for almost the first time she looks directly in my face and asks what color my 

eves are. 

T-13. She says, ‘‘I want to wear the blue dress in this school that looks cute.’’ She looks closely 


at me and questions the color of my hair, as well as noting I’m wearing earrings. She says, 
‘‘What color is your hair Dr. Seidman?”’ 


Almost as if now that she knew who she was and who the therapist was, she 
began to show clearer signs of identifying with the therapist. 
T-26. She comments on her own clothes. She doesn’t want to wear buttons because Dr. Seid- 
man isn’t wearing any buttons. She wants lipstick like Dr. Seidman. 


Sometimes the cues were not as clear, and Leslie’s needs could only be inferred. 
When Leslie seemed content to be alone almost for hours on end, the therapist in- 
sisted on doing things for her and doing things with her. Sometimes, however, when 
the therapist tried to shift the emphasis from some compulsive activity to something 
that seemed more ‘‘healthy’’, Leslie refused to participate, again showing the way, 
or at least setting the pace. 

T-23. I try to lure Leslie into a ball rolling game; but this does not have meaning for her, and she 
insists on going back to the clay. 


1Leslie’s father was killed in an automobile accident about five months after treatment began 
Movement in therapy will be described from excerpted notes taken from the case record. 
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Symbolic regression was encouraged by the therapist in response to Leslie’s 
revealing of real basic need for nurturance in the first session. 

T-1. [ introduce the rubber cow family, and Leslie identifies the mommy and baby. She is con- 

cerned over the milk—‘‘mommy has milk. Daddy milk’’. . . Leslie drinks from the cow’s 

teat. I recognize the hunger and show her the baby bottle that has ‘“‘milk’’ in it too. She 


drinks a mouthful but goes back to preoccupation with the clay mouse. She sucks the tail 
and says it tastes good 


During the second session Leslie was coaxed onto the therapist’s lap with an 
offer of hard candy to suck. Gains in her ability to use body contact were made very 
slowly. Leslie fought contact; then she seemed to enjoy it. Being a baby and “‘play- 
ing baby’”’ was to be a steady theme for a long time. At first she identified herself 
with her younger brother and later was able to enjoy the regressive play as herself. 

T-4 She comes to me for candy and climbs on my lap. I feed her candy, and she slips off 
quickly. Then she accidently drops a piece, slides off, gets it and crawls back ... . There is no 
more candy, and I offer her a drink out of the bottle. She snuggles on my lap and drinks it. 
“Be your little Roy.”’ 


T-6. Leslie asks for candy and sits on my lap in a resisting way - she scrambles off as quick- 
ly as she can. 


Leslie’s decision to enjoy the contact came slowly. 

T-30. With only a little bit of help, she becomes a baby. She gets onto my lap and demands to 
be fed with bottle and ‘“‘baby candy”’ . . . Leslie is bathed, diapered, burped, and fed over .nd 
over again .. . This is the session I have been waiting for. This type of regression is inevitable 
and exactly what Leslie needs. If she can get some nurturance out of this experience, it is 
almost possible that she might be willing to give up autism as a defense. 

T-37. She spends a good part of the time sitting on my lap, sucking on the bottle and just being 


cuddled .. . There is much less preoccupation with the symbols of being a baby, toileting, 
er 


A plan to involve mother more closely in Leslie’s therapy grew out of mother’s 
expressed concern over Leslie’s playing baby, and the hope that more warmth and 
relatedness could be kindled between mother and child. It was to be almost a re- 
teaching or a re-living process for them both. Therapy was planned for twice-a-week 
with the mother’s staying in the playroom on alternate sessions. 

T-31. Her mother seems capable only of staying remote and then of asking Leslie to identify the 
animals. She responds quickly to my suggestion that she play real mommy with Leslie and 
take her on her lap and give her the bottle 


T-35. Leslie wants to be fed almost immediately and I suggest she go to her mother and her 
mother will give her candy 


During sessions with the mother, Leslie again began to react with withdrawal 
and to save meaningful material for the alternate sessions. 


T-48. The sessions are more like nursery school. 


Once a week sessions were resumed and Leslie was more free in the playroom. 
Her experiences with her mother were useful, however, since mother and child now 
sat close to each other in the waiting room and mother reported that Leslie was being 
more affectionate at home and that she herself could give love more openly to Leslie 
and Roy. 

Growing up continued in the treatment sessions as well as at home. 


T-102. As Leslie chews a piece of candy, she says, “The baby is cutting teeth’’. Giving up the 
comfort of being a baby is not easy. Leslie says, ‘‘Tell the little baby a story.”’ and, “Sit on 
the baby’s bed.’’ At the end of the session, Leslie commands, ‘“Teach me to walk Dr. Seid- 
man is 


T-131. Leslie sits on my lap for just a moment ‘‘Remember when I played baby?” 


Leslie suffered severe trauma early in the therapy period. Leslie was able to 
use the help of the therapist. Following the accidental death of Leslie’s father, Leslie 
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was brought to the Center by her aunt. Mother was described as being in constant 
mourning and unable to answer Leslie when she called. 

T-18. Leslie keeps repeating in an anxious tone ‘‘Mommy wants to bring you” and I said that 
mommy did want to, but that she was sad because of daddy. Leslie asks, ‘‘Where’s Leslie’s 
daddy?” Leslie wants to make a painting and begins to paint black splotches on the paper 
“Pretty black.’’ As she paints she says, ‘‘Mommy wants to take me this time. She’s sad and 
sick. She wants to come, she’s sad.’’ Just before the time to leave, Leslie says, ‘‘Let’s take 
my hand and go downstairs.” de 

T-19. Leslie is extremely regressed—she is pre-occupied with clay and makes worms most of 
the session. One time she asks, ‘‘Where’s my daddy? He’s coming home this afternoon.” 


T-21. She makes worms of clay. “‘Worms can’t talk ... Where is my daddy—daddy is dead.” 


It was months before Leslie expressed her sorrow in an appropriate way. The 
therapist felt that Leslie’s first tears presented another opportunity to bring mother 
and child closer together. 

T-46. Leslie started to cry convulsively. After some preliminary comforting, I took her to her 
mother who showed her pictures in a favorite story book and pointed to pictures as Leslie, 
Roy, etc. Then she pointed to a policeman and Leslie identified it as ‘‘Daddy—he’s in 
heaven.”’ Back in the playroom she sobbed, “I want my daddy.”’ 


The birth of a child to one of her older sisters gave Leslie an opportunity to 
express feelings of jealousy displaced from her brother. 


T-42. Leslie spends the session sucking on the tiny bottle and bathing the baby. She tells me 
she musn’t hit the baby’s face hard. 


Anal and masturbation problems and castration anxiety were brought out. 

T-27. Leslie begins again to play with clay candles and electricity but soon gets into material 
about “peeing” and ‘‘poo-poo’’. She tells me she touches her poo-poo and then with the 
rubber genital dolls plays out mother’s spanking girl—girl’s spanking mother—everybody’s 
being bathed, washed, etc. She says touching poo-poo makes hands ugly and I reassure her 
that the hands are the same. 

T-29. Leslie keeps insisting that girls have penises and over and over again touches them and 
asks about them as she dresses and undresses the dolls. She keeps confusing herself with the 
dolls, tying in the boy with Roy and the girl with herself, saying yes she has a vagina—no, 
she has a penis—yes, she has a vagina, etc. . 


Self-mutilation and punishment was a repetitive theme that Leslie expressed 
through obsessive play. The therapist recognized her fears of destruction and 
offered her both understanding and protection. 

T-23. She makes a Christmas tree of clay and adds candles—she keeps repeating she mustn’t 
put her finger in the flame. 

T-26. She knows what she wants to do—she uses clay to make a candle and instructs me to 
have the little girl put her hand in the flames. I say I won’t let the little girl get hurt and I 
push the doll away each time. Then she makes an electric wire and plays at having a clay 
doll touch it. . . . little girl mustn’t touch the wire . . . little girl mustn’t, etc. 


T-30. She indicates she wants to put her hands on the electric fan, I say we will not do that... 


Leslie spent weeks in concern with candles, with electricity, and then in painting 
and making jails. Throughout, obsessive interest in abstractions, spatial and 
temporal concepts, were brought out. Leslie painted moons and suns and talked of 
white, cold snow and of outer space. After the marriage of one of her sisters (about 
T-100), Leslie drew wedding cakes and brides for several weeks. At the same time, 
despite the repetitive interests, there were continued gains in relationships both in 
the Center and within the family. ‘‘Did you miss you? . . . . Is mommy afraid you'll 
catch cold?”’ 

Following Leslie’s entrance into kindergarten, she began to ask many questions 
that indicated she was trying to test reality and trying to get help in understanding 
man-made rules. She seemed to be saying, “‘tell me what to do’. 
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T-88. She draws and paints the schoolyard and school-friends. She asks why you have to stay 


in line and why you have to take turns. She asks questions without waiting for the answers 
but is not bizarre. 


Gleaned from sessions 100 to 130 are the following quotes—showing Leslie’s 
growth and problems. 


What happens if mommy calls and you don’t like to come? . .. How come nobody likes a 
tattletale? How old are you when you die—is gray hair old? Why can’t we drink pee-pee?”’ 

“T don’t want a little brother. What’s a worry? Is it a worry when mother comes late? 
Why aren’t real people happy? 


Leslie is still in therapy. She has had one session following a planned four- 
month vacation, including the summer and the first marking period in school. She 
and her mother brought in her second grade report card, indicating average grades 
and no particular adjustment problems.* 

T-131. When I tell Leslie I missed her because I like her, she answers slowly “I think I like you 
too’. She indicates that she still has problems and wants my help in figuring them out. 
When you were a little girl did you ever have bad thoughts? " She reveals obsessional ideas 
about kidnappers and says, “It’ll be hard to get it out of my head’’, or “T’ll grow out out of 
it probably?” Leslie reassures me that she doesn’t draw kidnappers and then reveals her 
mother’s fear of her expressing these thoughts either at home or at the Center. I try to show 
Leslie that her mother worries because she loves her. 

I suggest that I can tell mother that Leslie is old enough now to have secrets at the 
Center and that here she can let me know what she is afraid of .. . 


Leslie was then able to make two drawings: one of kidnappers for the therapist 
to see, and one of cats for her mother to see. This appeared to be a conscious effort 
at repression. 

Growth was summed up in various ways. By Leslie, ‘I told my girl friend that 
she broke a promise to me and I’ll break one to her and she won’t like it. Is that 
what you'd do?’”’ ‘“‘Remember when we played baby? Remember when I used to 
draw jails?’’ And by mother, as Leslie snuggled on her lap and mother hugged her, 
“You're a big girl now and you can have secrets.”’ 


SUMMARY 

The first four years of an outpatient treatment program for a severely disturbed 
child has been described, with particular emphasis on work with the child herself. 
Despite strong theoretical arguments for the more intensive and _ all-inclusive 
program offered in a residential treatment center, an outpatient program within the 
framework of a child guidance center was attempted. Quotations from the weekly 
therapy notes reflect the gains that were made by the child. 

Two very important features of therapy that may distinguish Leslie’s case from 
that of other children seen at the Center are: (a) treatment has continued long he- 
yond the usual time period, and (b) the child’s therapist has remained constant dur- 
ing this four-year period. It can be speculated that this latter circumstance has made 
it possible for Leslie to feel secure enough to permit the closeness that allowed for 
symbolic regression, emotional feeding, and eventually the active building up of 
more useful defenses than withdrawal. 

It is important to note that Leslie was able to be regressed in the playroom and 
at the same time to meet in a limited way the demands of reality at home and in the 
classroom. Keeping Leslie at home made it possible for Leslie’s mother to make some 
superficial changes in the environment and to respond positively to changes in Leslie 
herself. The Child Guidance Center has been able to be flexible in planning for more 
or less intensity in the treatment program depending on Leslie’s needs, and is pre- 
pared to serve as consultant to the school, and to offer continued help to Leslie, in 
times of stress, as she grows. 


‘Up to T-130, Leslie s mother has had two social workers. When the second worker left the 
agency it was decided that mother would continue with Leslie’s therapist. 
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ANALYSIS OF INTERPERSONAL RELATIONSHIPS AS AN AID TO 
UNDERSTANDING FAMILY DYNAMICS: A CASE REPORT! 
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INTRODUCTION 


There has been an obvious long term trend toward greater emphasis on in- 
fluencing parents as a means of facilitating treatment of disturbed children. Re- 
cently, Ackerman"? and others °: 4: > ® have given added impetus to this trend by 
emphasizing the diagnosis and treatment of the family as a dynamic entity rather 
than a collection of individuals. Traditional psychological assessment techniques, 
however, orient clinicians in the latter direction. For purposes of family diagnosis 


and treatment it would be well, instead, to give greater emphasis to the following 
considerations. The same conflicts in a given individual will be heightened or min- 
imized, expressed in one way rather than another, elicit one response or another from 
other persons, depending upon what the other individuals in the family are like, how 
the individual expects the others to behave, and upon how they actually do behave 
Significant change in the behavior of one family member automatically will produce 
some changes in the responses of others in the family toward him. The full under- 
standing of a given family member, therefore, depends upon understanding his inter- 
personal interactions with the other family members. Fresh theoretical orientations 
and measurement techniques seem desirable to supplement traditional ones in im- 
plementing such considerations. 

What appears to be a singularly appropriate approach is available, although 
there seems to be no rapproachement as yet between its advocates and the major 
spokesmen for family diagnosis. We refer to Leary’s“) theory and techniques of in- 
terpersonal multilevel personality diagnosis. The purpose of this presentation is not 
to demonstrate what might be done by using all of the techniques, or only the 
techniques, of the interpersonal diagnostic system. It is intended, rather, to illustrate 
how the traditional sort of case analysis, presented first below, was complemented 
by a second analysis employing only a few of the non-quantitative conceptualiza- 
tions of Leary’s system, especially those having to do with “interpersonal reflexes’ 
and the principle of ‘“‘reciprocal interpersonal relations.”’ 


THE Case Stupy 


Presenting Problem. Helen was a nine year old white female residing in a lower middle class com- 
munity, attending the third grade in the neighborhood public school. Her father, Mr. D., was 31 
and not yet settled occupationally, having shifted jobs within the area of retailing every year or 


‘This case was seen as part of a Family Group Therapy Research Study being conducted by the 
authors and supported by a grant from the University Research Council of Rutgers, The State Uni- 
versity of New Jersey. 
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two since returning from the service. The mother, Mrs. D., was only 26 years of age, having 
married at 16. Helen’s early childhood was reported as having been stable and happy, although 
she had been cared for by the maternal great-grandmother during the day, Mrs. D. having been 
obliged to work outside the home. Early school adjustment was good in spite of several school 
changes. In the second grade, while the family was residing in South Carolina, Helen began re- 
sisting going to school at home and then resisting again in the classroom. She rebelled to such a 
degree that the principal would no longer tolerate the disturbance she caused, and insisted that 
she be enrolled in a private day school for emotionally disturbed children, where she apparently 
made a good adjustment. At the same time, she was seen privately by a psychologist for about 
six sessions, with no appreciable results. The services of the psychologist were sought primarily 
because of the appearance of additional symptoms. Helen had developed an obsession about 
death, and expressed morbid curiosity, mixed with fears, about childbirth and many aspects of 
sex 

At the time of referral to the Rutgers Clinic, Helen had recently returned to New Jersey and 
re-entered a public school. She was repeating her behavior of the previous year, pitifully clinging 
to her mother and sobbing, exhibiting violent displays of temper including biting, striking, re- 
fusal to move, abusive language, screaming, and last minute illnesses to prevent herself from being 
sent to school. Consultation with the school psychologist revealed that there was apparently little 
concrete about the school situation itself to account for this behavior. When she was settled in the 
classroom for the day, Helen’s adjustment and academic performance were good. The parents 
reported that Helen had become completely unmanageable at home, refusing to abide by any 
limits, and preventing (by means of the behavior described above) the parents from going out to- 
gether without her. In addition, fears of death, growing up, and sex were expressed with ever- 
increasing frequency. 


Test Behavior. The test battery consisted of the WISC, Goodenough, Bender-Gestalt, Michigan 
Picture Test, and puppet play observation. The parents were interviewed, and each filled out an 
Interpersonal Check List for self, spouse, and child. Both parents appeared intelligent, spoke 
freely and openly about the problem, and seemed well-motivated to do something about it. Mrs. 
D. was the more verbal of the two, particularly with regard to describing her daughter’s “‘faults’’. 
Mr. D. tended to tone down his wife’s statements and expressed himself with more control. 

Helen, herself, was a thin, wiry child with an intense, nervous manner and a flippant, self- 
conscious way of responding to attention. She related easily, and quickly expressed feelings of 
love and possessiveness toward the examiner, primarily through notes that she wrote. Just as 
quickly, she expressed irritation and impatience with the tests or the examiner when the tests be- 
came difficult or conformance was demanded. There was extreme testing of limits along with 
efforts to control the situation with threats, pouting, and withdrawal of affection. In general, 
however, it was obvious that Helen wished to succeed, enjoyed doing so, and was self-conscious 
about failure. 


Summary of Traditional Evaluation. Helen was of average intelligence and showed no signs of 
neurological impairment. Dependency gratifications seemed to have been minimal. Infantile 
narcissistic needs still exerted a strong influence on Helen’s perceptions and behavior. Aggressive 
impulses were strong, particularly toward her mother, whom Helen regarded as rejecting. Con- 
flict between a great need for dependency gratification from the mother and the hostility resulted 
in marked ambivalence toward women, inadequate feminine identification, and sexual role con- 
fusion. Competition with her mother, based on strong Oedipal conflict, made the dependency- 
hostility conflict still harder for Helen to resolve. Father figures were seen as ineffective, but not 
rejecting. 

Helen's ego was weak, and she was quite unsuccessful in gaining adequate satisfaction for her 
needs. Superego development was uneven, and emotional controls were poor. Impulses were apt 
to break through in extreme and inappropriate ways. Reality testing was not, however, seriously 
deficient. Helen’s self concept was not so much negative as it was tentative and ambivalent. She 
vacillated between feeling worthwhile and adequate, even powerful, and very unloved and in- 
adequate. Naturally, a great deal of anxiety was generated by these conflicts and uncertainties. 
Helen's major defenses against these anxieties seemed to be denial and displacement. 

The presenting problem seemed to be explained by these findings. The overdependence, 
combined with hostility toward the mother, prompted her inability to attend school without 
rebellion. This was a perfect vehicle for satisfying the two opposing drives. The sex problem arose 
from her confusion regarding her sexual role, and the fears of death and separation again from the 
conflict of hostility versus dependence. Fears of growing up seemed to arise from all of these 
dynamics working together to make the future most insecure and forbidding in prospect. Her 
strengths were her adequate intelligence, the existence of some positive self percepts, and, most 
important, the fact that she continued to desire to win affection from others and continued to 
strive, albeit in ineffective ways, toward this end. Since the roots of this problem seemed to go 
back to the earliest stages of psychosexual development, the prognosis was guarded. 


Summary of Analysis of Interpersonal Family Dynamics. In the clinician’s eves, both parents 
seemed weak and ineffectual in their interpersonal mechanisms as evidenced in the interview, 
by the ease with which Helen manipulated them, and by the father’s apparent vocational in- 
stability and lack of direction. This was quite a different picture from their own self-perceptions. 
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They did not see one another or themselves as weak, but as fitting in with cultural expectations of 
adequate authoritativeness and responsibility. 

Consciously, Mrs. D. saw herself as sympathetic and forgiving, responsible and generous. 
She saw herself as having little in common with her daughter, whom she saw as calculating, think- 
ing only of herself, self-satisfied, boastful, and rebellious. The father, on the other hand, though 
he recognized rebelliousness in his daughter, emphasized this much less than the mother, and was 
more prone to see good qualities such as leadership and even ‘“‘making a good impression on 
others’’ as descriptive of her. His perceptions of Helen were rather similar to his consciously 
reported perceptions of himself. 

Helen apparently saw the mother as alternating between tender, sympathetic, and soft- 
hearted behavior on the one hand, and aggressive, rejecting, punitive, and hypercritical behavior 
on the other. She saw the father as being capable of aggressive outbursts, but as predominantly 
cooperative, affectionate, and understanding. Helen regarded herself as being the victim of her 
mother’s rejection and the ‘darling’ of her father. Her self-perceptions, though, seemed to be 
more a reflection of her mother’s behavior toward her: extreme vacillation between love and 
hate. (Helen wrote frequent notes to her parents which were almost invariably signed, ‘‘Iove or 
hate, Helen.’’) 

Docile-dependent attributes in all the family members were conspicuous by their absence 
at the level of consciously reported perceptions. However, a pattern of dependency on an un- 
conscious level was present in all members of the family. Dependency was particularly strong 
and important in Helen’s case. She behaviorally expressed dependency to a marked degree, but 
mainly when it could be expressed in a rebellious, aggressive context; that is, when it ran counter 
to the immediate wishes of the parents, as when they wished to go out for an evening, or when 
they wanted Helen to be independent and fulfill her social and cultural obligations. Thus, her 
dependency was rarely expressed in such a way as to draw the reciprocal nurturant and warm 
responses from the parents, particularly the mother. The reasons for the lack of direct and appro- 
priate expression of dependency seemed to be Helen’s hostility and competitiveness toward her 
mother, which also demanded expression, and the fear that her mother’s response to a direct bid 
for nurturance would be overprotection and domination. The closest she could come to ‘‘pulling’’ 
the nurturant responses, unmixed with hostility, from the mother was through feigned and/or 
psychosomatic illness. Through this behavior she could continue to be the one in control while 
receiving sympathy and warmth. Moreover, she could terminate the situation when protection 
threatened to become overprotection, simply by becoming well again. 

A rejecting reaction to Helen’s rebellion-cloaked dependent behavior was made al] the more 
likely and powerful because of Mrs. D.’s image of what she should be like as a mother. Her ideal 
was to be a strong, nurturant, and protective mother, which involved the wish and expectation 
for reciprocal behavior of obedience and respect, trust and acceptance, from Helen. She wished 
Helen were dependent upon her for emotional satisfaction and guidance. Helen’s deviations from 
this idealized picture produced extreme vacillation of feeling and behavior in the mother. She 
alternated between self-effacing and self-derogatory behavior at one moment, when she felt that 
she had been deficient in living up to this image, and resentment and aggressive behavior the next 
moment, when she placed the blame for destroying this ideal on Helen 

The pattern, of course, was sometimes initiated from the other side as well, with the mother 
the first to show rejecting or punitive behavior toward Helen, calling forth a response of counter- 
rejection and rebellion, which, in turn, reinforced the mother’s negative response. It was often 
some demonstration of healthy independence on Helen’s part that called forth Mrs. D.’s initial 
critical response. It was assumed that the mother’s distorted interpersonal reflex patterns and 
conflicting interpersonal motivations greatly contributed to the development of the inadequate 
pattern of interpersonal reflexes later shown by Helen in this area. But the counter-reflexes 
developed by Helen made it much harder than it originally would have been for Mrs. D. to ex- 
press warmth without domination, and therefore harder to break the vicious cycle. 

Docility and dependence were important characteristics of Mr. D. While he did not perceive 
these aspects in himself, he probably expressed them most directly and openly of the D.’s. Faced 
with one of Helen’s demanding moods, Mr. D. would usually leave the house rather than take 
an active part in attempting to control Helen or join his wife in aggressive harangues. Mr. D. 
occasionally did control Helen with angry, punitive outbursts, but most of the time he was 
virtually helpless in preventing her from taking advantage of him. This was apparently the case 
not onlv because of his docility, but because of dependence upon her for gratification of his con- 
cept of himself as a kind, generous, understanding, sympathetic person in general, and parent in 
particular. Dependency upon his wife was more pronounced. He depended on her not only in 
the emotional area, but in more practical matters as well, e.g., budgeting, fulfilling social obliga- 
tions, and handling problems with neighbors, in-laws, etc. While Mrs. D. and Helen were usually 
the antagonists in the family dissensions, Mr. D. seemed to contribute to the high level of conflict 
if only because of his genera! ineffectiveness in the interpersonal area and because his dependency 
on both would not permit him to take sides. With no clear allegiance to either one, he remained 
the target for and object of both mother and daughter’s respective efforts to gain the dominant 
position. 

Helen’s interpersonal reflexes were the most limited, automatic, and overdeveloped, and by 
virtue of this fact it was she who dictated the nature of most of the family’s interaction. The ease 
with which Helen could step into the position of dominance and control over her mother, and the 
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frequency of the mother’s descent to the level of using child-like recriminations and arguments 
with her daughter lent an element of reality to Helen’s Oedipal fantasies. Helen’s confusion in 
sexual role and her masculine strivings were seen as emerging from repeated reinforcement for 
expressing authoritarian and dominating interpersonal reflexes, from hostility engendered toward 
the mother in the blind struggles mentioned above, from the inability to ‘pull’? warmth and 
sympathy from her mother, and from the identification the father felt for Helen, which gave rise 
to a reciprocal identification on her part. 


Therapy. The interpersonal dynamics seemed to be such as to present an interesting case for 
joint mother-daughter therapy sessions. It was felt that this might present the quickest route toa 
breakup of the interpersonal misunderstandings and vicious cycles. Joint therapy sessions in- 
cluding the father were not possible because of his employment. The mother and daughter were 
seen together by a single therapist, followed immediately by separate therapeutic sessions for each, 
again with the same therapist. Discussion of therapeutic techniques would not be appropriate in 
this paper, but it does seem appropriate to give some form of closure to the reader by reporting 
that it appeared to the therapist that the joint interviews provided a unique opportunity for 
mother and daughter to express their deeper feelings about each other directly to each other in a 
situation allowing for therapeutic intervention. In addition, the joint sessions seemed to serve as 
a catalytic force in promoting the working out of the feelings and conflicts in the individual 
sessions. The interpersonal themes mentioned above were very dramatically displayed and 
worked through in the therapy. The case was closed after a total of 22 forty-minute individual 
sessions for each, and 17 twenty-minute joint sessions. The therapist’s opinion, test results, and 
the family’s reports at that time indicated that, in addition to a cessation of the presenting prob- 
lems, healthy patterns of communication and interpersonal behavior had largely replaced the 
former pathological ones. There was similar evidence that Helen’s problems of sexual role identi- 
fication had been satisfactorily resolved. Whatever interpersonal and intrapersonal conflicts re- 
mained, no longer seemed outside of the family’s capacities to work through without further pro- 
fessional help 


SUMMARY 


In this paper the potency of traditional psychological orientations and tech- 
niques was questioned in the light of recent attempts to understand and treat fam- 
ilies as a unit, as a group in psychological interaction. The system of interpersonal 
diagnosis as developed by Leary was viewed as a system of great promise for this 


purpose. The case of a nine year old girl who refused to go to school, had fears of 
death, fears of growing up, and was confused in sexual role, was presented in tradi- 
tional fashion, and as a family study utilizing some concepts from Leary’s inter- 
personal system, in order to illustrate the utility of these concepts for understanding 
family dynamics in action. 


REFERENCES 
ACKERMAN, N. W. The Psychodynamics of Family Life. New York: Basic Books, 1958. 
GooutsH1an, H. A. A Brief Psychotherapy Program for Disturbed Adolescents. Paper read at 
Amer. Orthopsychiat. Assn. Chicago, 1960. 
Leary, T. Interpersonal Diagnosis of Personality. New York: Ronald Press, 1957. 
Rapaport, Ruona. The Family and Psychiatric Treatment: a conceptual approach. Psychiat., 
1960, 23, 53-62. 
5. Scuuster, F. P., Jk. Summary Description of Multiple Impact Psychotherapy. Tezas Rep. on 
Biol. Med., 1959, 17, 426-430. 
6. Situ, B. kK. A Family Grows. Austin, Texas: The Hogg Foundation for Mental Health, The 
Univ. of Texas, 1959 





INDIVIDUAL THERAPY WITH A “DEFECTIVE DELINQUENT” 
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INTRODUCTION 


The elusive concept that the so-called ‘‘defective delinquent” is not amenable 
to rehabilitation is still a widely held opinion among institutional administrators. 
Westwell“!), for example, quotes one superintendent as recommending that ‘‘the 
defective delinquent” should be triple locked up and permanently ‘“‘taken out of 
circulation.’’ @ihers have expressed the view that psychotherapeutic intervention 
is unlikely to eliminate delinquent behavior. Thus, Lurie et al‘) concluded that 
“prognosis with regard to cure of behavior difficulty is uniformly poor.’’ In recent 
years, Sarason ‘*’, Stacey? and Thorne“ have expressed a more positive opinion 
regarding psychotherapeutic endeavors with mental defectives in general and 
defective delinquents in particular. While environmental therapy has been stressed 
and documented in some English studies“: 7”: '*), and group therapy results have 
been published in this country“ *: ®), there seems to be a paucity of reports of in- 
dividual psychotherapy. This paper offers a case study with a defective delinquent 
who was rehabilitated through intensive individual psychotherapy. The subject was 
selected by the staff of a large Colony for male defectives as belonging to the group 
of “incorrigible” defective delinquents. The procedure of selection and the results of 
a large scale counseling and psychotherapy program have been described else- 
where. “) One to three therapeutic sessions of 20-50 minutes each were held weekly 
for 18 months except for vacation. 


CasE REPORT 


Peter is a twenty-four year old single man. Both parents were born in Russia, 
had no formal education and came to this country after World War I. Father died in 
1942. Mother died in 1954. The marriage was reported as a stormy one. His 
mother was considered a good housekeeper but completely unsuccessful in control- 
ling her son; her attitude was one of both rejection and over-indulgence. Two sib- 
lings are college graduates. Pre-natal and developmental history is sketchy and not 
reliable. Peter was an unwanted child. His birth was normal and he had generally 
good health except for pneumonia at one year of age. School attendance began at 
the normal age but he was excluded almost immediately. Since entire family life was 
upset by Peter’s presence in the home, school was reapproached but again resulted 
in exclusion because he was “unmanageable, inattentive, deliberately disobedient 
and resorted to attention-getting in the classroom situation.’ Following his second 
exclusion from school, it was decided that Peter should be tried in a foster home 
placement. He was placed in 1942 but ran away and it proved physically impossible 
to contain him. In October, 1943, Peter entered a residential treatment center for 
disturbed delinquents and truants in New York State. While there he continually 
invited punishment from his fellow inmates. He nagged, provoked and used all 
possible devices to direct attention toward himself. His aggressiveness was mani- 
fested by clownish, buffoonlike bantering and persistent demands and questions. 
He bit his nails severely, had a voracious appetite and wet the bed several times a 
night. Psychotherapy was initiated and immediately resulted in a marked change 
in social manners, improved school adjustment and less severe nail biting and en- 
uresis. Unfortunately, the therapist had to leave and Peter could not adjust to his 
new therapist. he became explosive and destructive, and on one occasion jumped 
on her,wrapping his legs around her body and kissing her. The staff felt that the in- 
stitution could not cope with Peter’s destructive and disruptive behavior and recom- 
mended transfer to a psychiatric hospital. The mother did not agree and took the 
boy home in December, 1946. 
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At home Peter refused to obey his mother and tried in every way to annoy her. 
He would run outside and ring the doorbell until she became extremely upset. Since 
the mother was unable to cope with him, admission to a Residential Treatment 
Center in New Jersey was arranged. The diagnostic impression given after his ad- 
mission in April 1950 was ‘Disorder of personality due to epidemic encephalitis.” 
The exhibited behavior difficulties were in the form of constant talking, interfering 
with the activities of other children, hyperactivity, provocative behavior, etc. He 
also on numerous occasions felt the breasts of women supervisors and of strange 
women visiting the Center. Peter responded poorly to the various therapies insti- 
tuted for him and on reaching the Center’s age limit was released in February 1950. 
A subsequent evaluation at the New Jersey Diagnostic Center resulted in a diag- 
nosis of ‘“‘psychopathic personality with sexual deviations, compulsive type, possibly 
associated with organic encephalopathy”. Peter was transferred to a reformatory. 
In his four years there, he seemed unable to fit in any program and was regarded as 
‘‘our most abnormal boy’’. Constant violation of the regulations resulted in segrega- 
tion and punishments. He was paroled April 1954 but because of voyeuristic act- 
ivities, drinking, etc., parole was revoked and Peter was recommitted four months 
later to the State Colony for Defectives. During the first two years his behavior was 
similar to that experienced before, with some further decline noted. Psychological 
testing resulted in IQ ratings of 72, 51, 57, for the Verbal, Performance and Full 
Seale of the Wechsler-Bellevue. The Wide Range Achievement Test showed the 
fcllowing grade equivalents: Reading 5.2, Spelling 4.8 and Arithmetic 3.1. Until 
1957 Peter spent most of his time in the detention cottage, repeatedly sent there for 
breaking institutional rules, and for homosexual and aggressive acting-out behavior. 


THE CouRSE OF PSYCHOTHERAPY 


After reviewing Peter’s case during the Spring 1957, the writer established a 
therapeutic relationship with the patient. The first six months of individual psycho- 


therapy produced little noticeable effect. Peter continued his behavior pattern and 
made frequent trips to the detention cottage where therapy was continued. He 
verbalized feelings at that time that he was a bad boy and nothing would make him 
change his ways, that this was the way he wanted to be, and he didn’t care whether 
his behavior would result in a complete and eternal confinement to the detention 
cottage or some jail. However, at the end of the six month period, it seemed that 
the very permissive atmosphere which the therapist created for Peter and the fact 
that the therapist did not lose interest at the time of his difficulties with institutional 
administration had finally made an impression on him. He seemed to acquire some 
social skills and ideas and made a promise that he would at least try to change some 
of his behavior patterns, would try to keep himself out of detention, and would take 
an assignment as a worker. Previous to this time Peter was unable to hold any work 
assignments; he had been tried in many training situations throughout the institu- 
tion and had constantly failed. During these first six months, Peter’s behavior in the 
therapy sessions could be characterized as extremely polite and subdued, as he tried 
to please the therapist. He never failed to thank the therapist for his efforts to help 
him, was extremely courteous, and tried to do small favors to gain the therapist’s 
confidence and approval. 

During the second six month period, Peter’s behavior changed considerably. 
He was very proud of his new ability to keep out of mischief and the absence of any 
disciplinary action. He counted every day from his last confinement in detention. 
Now, however, the therapist became an object of Peter’s extreme hostility and ag- 
gression. He came to the sessions accusing the therapist of all kinds of things, telling 
him that everyone in the Colony was trying to help him except the therapist. He 
questioned the value of the hour and the therapist’s ability to help him. He brought 
something new to every session, attempting to discover if this would make the 
therapist feel bad or perhaps reject him. He asked repeatedly what the therapist 
would do if he would attack his wife (who at that time was a teacher inthe Colony 
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school.) Quite often he would remain outside the door of the therapist, not coming 
in but also making sure that the therapist would not become occupied with another 
boy or another activity. 

After the three week Summer vacation period, Peter greeted the therapist with 
friendliness. He expressed his sincere feelings of having missed the therapist, talked 
about his own inadequacy and inability to face the problems of the world, and of 
how he really feared going out into a strange and hostile community. This was the 
first time he was able to express hostility toward his deceased mother who had re- 
jected him, placed him in institutions and never really cared about him. Similar 
feelings were directed against other members of his family who would come once- 
in-a-while to visit him or send small gifts, but were not really interested. This period 
was therapeutically very fruitful. He brought his dreams to the session and gained 
quite considerable insight in their meaning. He was also able to discuss every-day 
matters and his behavior toward peers and supervisors. His judgment and insight 
in these situations appeared quite mature and developed more and more as therapy 
continued. The cottage parents’ reports from this period forward showed a marked 
improvement in his behavior and working habits. He was amenable to counseling 
and, probably for the first time in his life, accepted a job and performed very well. 

A schedule was set up for Peter to go home for weekends and gradually for a 
week. It was felt that this might lead to a good relationship with the family, which 
could eventually result in a job placement with supervision from home. It proved a 
very trying period for the brothers. Peter turned all his aggression and hostility 
toward them, especially at the time of the older brother’s marriage. He created 
numerous disturbances during his home visits, running away, annoying everybody, 
stealing money from the younger brother, etc. His behavior was discussed in the 
therapy sessions and interpreted as an attempt to make the brothers reject him, 
which in turn, would give him justification for his delinquent behavior. Peter finally 
gained insight and after a long period of difficulties a satisfactory relationship de- 
veloped between him and his brothers. 

A formal psychological test was administered in the Spring of 1958 and a 
marked improvement in all areas was evident. On the WAIS, the Verbal IQ in- 
creased from 72 to 76, Performance IQ from 51 to 66, and Full Scale 1Q from 57 to 70. 
On the Wide Range Achievement Test, his grade level for Reading went up from 5.2 
to 8th grade and Spelling from 4.8 to 5.1 grade level. The projective tests showed 
that Peter was more comfortable with his home environment and less aggressive 
The concluding phase of therapy was not too different from the previous one. Peter 
was again able to bring his feelings and his problems to the sessions and discuss them 
in a frank and orderly manner. His behavior improved even more and, as noted 
above, the family contact and family relationship grew stronger and he felt at ease in 
the new setting. He went home more frequently, enjoying the life of the community 
and the relationship with his family. One very important change occurred during 
this last period. Peter was always a ravenous eater and was markedly overweight. 
Beginning approximately at the end of Autumn, he lost close to forty pounds, which 
seemed a good indication of lessened orality, of gaining inner security and balance 
It was the unanimous opinion of the Colony Staff that Peter had made an outstand- 
ing adjustment; and his behavior on work assignment was excellent and not once 
had it been necessary to confine him to detention. 

Peter was placed in the community approximately two years ago with the 
understanding that he would continue receiving supportive therapy during his com- 
munity life and work. Unfortunately, this did not materialize. During this period 
Peter did not become involved in any delinquent acts; there were no reports of 
sexual misbehavior and, as far as his over-all behavior was concerned, his adjust- 
ment was considered fair. His work record, however, was only marginal. He still 
preferred to live on his brother’s earnings, and was not willing to make a whole- 
hearted effort to take a full time job. There were some suggestions that the writer 
should continue psychotherapy with Peter until a complete readjustment to the 
outside world was achieved but, for technical reasons, this was not feasible. 
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CONCLUSION 

Individual psychotherapeutic intervention in the case of a young “‘defective 
delinquent’ was described and it was shown that, contrary to other reports in the 
literature, prognosis can be favorable. It is suggested that institutions and agencies 
dealing with borderline or mildly retarded delinquents should make extensive use of 
psychotherapeutic techniques. The importance of intensive counseling after release 
from the institution was emphasized. Instead of supporting the widely prevailing 
attitude of pessimism and custody, the case presented may offer some hope for re- 
habilitating defective delinquents, helping them to become more productive citizens 
and reducing the burden of society. 
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PROBLEMS OF PSYCHOLOGICAL DIFFERENTIAL DIAGNOSIS 
ASSOCIATED WITH HYPOTHYROIDISM: A CASE STUDY 
REBECCA SNYDER 


Veterans Administration Hospital 
East Orange, New Jersey 


INTRODUCTION 

The necessity for recognizing that the psychopathology of patients suffering 
from organic disturbances may simulate very closely disturbances of a functional 
nature is knowledge which is sometimes easily lost sight of when florid manifesta- 
tions of a thought disorder are evidenced and leave no uncertainty about the presence 
of a psychosis. Failure to differentiate organic psychoses from functional psychoses 
before the occurrence of irreversible organic damage deprives the patient of the 
potential benefits of medical intervention which such a differentiation might make 
possible. Asher“) points out the truism that amelioration of the condition depends 
upon its recognition and describes the psychoses of hypothyroidism as conditions 
which go unrecognized by physicians not infrequently and are misdiagnosed as 
schizophrenia. The disorientation, ideas of reference, and delusions of persecution 
commonly found in such conditions contribute to this confusion “:?:*). Bleuler®? was 
aware of the difficulty and described some ‘‘rare cases of hypothyroidism scarcely 
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distinguishable from schizophrenia’”’. The Clinical Society of London, too, in 1888 
made known its recognition of the psychotic aspects of myxedema. Asher“? quotes 
from this report as follows: ‘‘Delusions and hallucinations occur in nearly half of the 
cases, mainly where the disease is advanced’”’. More recently Asher”, Cranswick ° 
Gregory “’, and Zondek and Wolfsohn? are among those describing ‘‘schizophreni- 
form’ psychoses associated with thyroid hypofunctioning and clearing subsequent 
to thyroid medications. Asher“? believes that this condition exists much more fre- 
quently than is recognized. He found that 9 out of 14 psychotic patients referrred to 
him within a period of 4 years recovered after thyroid medication although none of 
them had been diagnosed by the referring physician as having hypothyroid condi- 
tions. On the basis of this experience he concluded that there must be many others 
which go undiagnosed because of the lack of specificity of the resultant psychoses. 

The problem of differentiating psychoses associated with thyroid hypofunction- 
ing from functional psychoses is no less confusing for the psychologist since he too is 
confronted with a lack of specificity in the nature of the resultant psychoses. Swartz 
and Ferguson “‘) presented a case report of a patient whose psychopathology had 
been, prior to testing, considered by her therapist to be functional while the inter- 
pretation of the test findings suggested a possible organic dysfunction with the sub- 
sequent identification of hypothyroidism as ‘‘a major organic factor’. The case be- 
ing presented here is that of a patient with an established diagnosis of hypothyroid- 
ism. A psychological evaluation was requested because of. ‘‘bizarre behavior” which 
he was exhibiting. His physician raised the question of a possible functional psycho- 
sis. At the time of the referral request, the report of the patient’s previous hospitali- 
zations had not yet been received so that neither the referring physician nor the 
examiner was aware of the patient’s having formerly been diagnosed as suffering 
from paranoid schizophrenia. 


CasE REPORT 


The patient was a 44 year old bachelor whose medical history indicated a condition of hypo- 
thyroidism and hypopituitarism, apparently of longstanding duration. Paranoid ideation was 
florid and had contributed to his having formerly been diagnosed as having paranoid schizo- 
phrenia. Characteristics of his psychological functioning suggested to this examiner the possibility 
of an organic psychosis in which paranoid features were prominent. The occurrence of both an 
organic psychosis with prominent paranoid features and the condition of hypothyroidism sug- 
gested to the examiner the possibility of a relationship between the two, a “schizophreniform’’ 
psychosis associated with the hypothyroidism. Since the patient died 16 months subsequent to 
the psychological examination without having shown any remission of his psychosis despite 
thyroid medication, proof of such a relationship was not established. Autopsy findings, however, 
did reveal “atrophy of the adrenals, thyroid and pituitary glands’’ so that failure of thyroid 
medication to bring about the remission of psychosis could have resulted from the presence of 
irreversible organic dysfunction. We have no way of knowing whether earlier thyroid medication 
might have alleviated his psychotic symptoms. The purpose of presenting this case is to share 
with others the problems of differential diagnosis which arose and to focus attention upon a condi- 
tion which may well be escaping our attention and being categorized as schizophrenia. 


Psychological Examination. The patient was seen on two different occasions within a three day 
time interval and given the following tests: Wechsler Adult Intelligence Scale, Bender-Gestalt 
and Rorschach. Suspiciousness was clearly apparent in the qualitative nature of the patient’s 
initial interaction with the examiner. It was as if he were enveloped by it and withdrawn into 
himself so as to make as little contact as possible with potentially harmful people. He seemed al- 
most mummified. Unlike the typical schizophrenic, however, he seemed gradually to lose some of 
his mistrust and became more outgoing and animated. He even tried to be helpful and at one 
point made a rather pathetic attempt at being humorous. Paranoid ideations, however, sporad- 
ically erupted in very dramatic form. He asked whether the person in the next office who could be 
seen through a window between offices was taking his picture, and on questioning indicated that 
he knew that was one technique that the FBI used when trying to find out about anyone. He 
added that he did not believe this was being done to him. He described his life on the ward as 
one in which he was being ‘“‘humiliated’’ by the other patients. Most striking of all, however, was 
his drastic reaction to Card VI of the Rorschach. This evoked in him what he considered ‘“‘evil’”’ 
thoughts. This he projected upon the cards by calling them “evil’’ and indicating that the exam- 
iner was trying to do something evil to him by showing him these cards. He asked whether the 
card had “anything to do with a woman’’; then he put it down as if he were being contaminated 
through contact with it. He asked the examiner what she was “‘driving at’’ and added when asked 
his thoughts about this, ‘‘If it’s what I’m thinking, you’d better stop.”’ 
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Sexual confusion was apparent as he revealed his close identification with a sister who has 
“bladder trouble’’ and numerous bodily complaints for which her physician could find no organic 
vais He sent her to a psychologist who indicated that experiences of the past were now influenc- 
g her. His defensiveness was temporarily gone as he asked the examiner whether she thought 
thie might also be true of him. Feelings of guilt and thoughts of evil soon intruded into his think- 
ing again and he became almost incoherent as he communicated in fragmented phrases and tele- 
graphic style the content of his disturbing thoughts. They concerned his having looked at. such 
“evil’’ pictures in the past, in smokers and men’s club rooms and feeling guilty about it. He spoke 
in agitated manner of the “‘evil’’ pictures he had seen and of the need “‘to confess to the Priest 
whatever is in your thoughts.”’ ‘“‘I went to Confession and don’t want no more of these a 
he stated with vehemence. He believed that there was “something evil within’’ him which no one 
could help, neither the physicians nor anyone else. Paranoid ideation, fragmented thinking, loos- 
ening of associations, flattened affect, the ready use of projection, and erratic intellectual function- 
ing with success on some items more difficult than those he failed, indicated a paranoid psychosis, 
resembling paranoid schizophrenia. 

Qualitative rather than quantitative aspects of this patient’s functioning suggested to this 
examiner the probability of an organic rather than a functional psychosis. The following features 
of his functioning seemed atypical of the schizophrenic and more characteristic of the organic: 
dissatisfaction with his own functioning on concrete tasks and the acceptance of the limitations as 
being within him, feelings of inadequacy, perseveration, feelings of perplexity and confusion, 
misuse of words, pattern cohesion, concretistic thinking, a subjective feeling of having slowed 
down in his responses, the spontaneous moments of helpfulness, involvement in the tasks, at- 
tempts at improving his performance, and the gradual emergence of clear thinking from the 
confused state in which direct questions sopra threw him. It was as if he needed more time 
than usual to fully grasp the essence of that which he was being asked. 

His obtained Full Scale 1Q of 93 was still within the normal range, Verbal IQ, 96 and Per- 
formance IQ, 91. The erratic nature of the patient’s intellectual functioning was reflected in the 
intra- as well as intertest scatter. His performance on the Digit Symbol subtest was equivalent to 
an IQ of 79. This occurred despite his ready understanding of the nature of the task and was a 
reflection only of his iratbility to work quickly. There were no errors, simply an insufficient num- 
ber of blocks filled. His highest score was obtained on Block Designs (equivalent to IQ 112) 
Pattern cohesiveness and perseverative tendencies prevented his obtaining an even higher score. 
His approach to both of these subtests of the WAIS impressed this examiner as organic. 

The patient’s handling of the Bender-Gestalt drawings showed no distortions of designs but 
some perseveration. He was concerned with the accuracy of his reproductions and tried to do well. 
Dissatisfaction with his own designs was expressed as were requests for further structuring from 
the examiner. 

The Rorschach was not completed since the patient refused to continue after seeing Card VI. 
Up until that time form level was good and concretistic thinking conspicuous. 

The patient’s psychological functioning thus revealed florid examples of disordered thinking 
such as are characteristic of a paranoid psychosis. Features suggestive of a possible organic im- 
pairment were also present so that the diagnosis of an organic psychosis seemed most applicable 
to the total picture. The organic manifestations were, however, much less dramatic and con- 
spicuous clinically than the paranoid ones so that the patient’s behavior in many ways so closely 
resembled that of a paranoid schizophrenic as to raise the question of which was primary or 
whether the two conditions coexisted with no causal relationship between the two. A careful and 
accurate history about the nature of the patient’s personality changes could have been helpful 
in attempting to resolve this problem, but there was no reliable source from which to obtain this. 
The history which he gave indicated a thyroid condition of long standing and the possibility of 
irreversible changes having occurred was recognized. 


SUMMARY 


The problem of differential diagnosis between organic psychoses associated 
with hypothyroidism and schizophrenia is focused upon in this case presentation 
of a patient with paranoid psychosis and a severe hypothyroid condition. It is sug- 
gested that the organic psychosis with paranoid features such as has been described 
might have been associated with the hypothyroidism. Definitive evidence for this 
was not however established since thyroid medication failed to bring about a re- 
mission of the psychosis. 
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PROBLEM 

Considerable IQ discrepancies in groups of retardates have been obtained be- 
tween the Leiter International Performance Scale (LIPS), the Stanford Binet (SB) 
and the Wechsler Verbal (WAIS-V) and Performance (WAIS-P) seales “:* ©, In 
some experiments, Performance IQ was superior, in others Verbal IQ. Variation in 
intelligence level was thought to contribute significantly to these discrepancies. 
This study investigated differences between the above tests over a wide IQ range. 
The following sources of variation are considered: test difficulty in terms of overall 
differentiation between tests; nature of test in terms of a Performance /Verbal test 
dimension; intelligence level in‘terms of five grades maximally varying as to 1Q; and, 
finally, kind of intelligence level in terms of three level criteria varying along a Per- 
formance /Verbal dimension. 


METHOD 


Subjects. Seventy-five retardates, who completed all tests, were drawn from the 
resident helper male population of Woodbine State Colony. The main selection 
criterion served to produce greatest possible IQ variation. 


Tests. The tests selected are the LIPS, a pure performance scale; the SB (Form 
L), a mixed performance /verbal scale; the WAIS V and P subscales which are dealt 
with separately because of the differential effects between verbal and performance 


tests. The sample means, with standard deviations in brackets, are as follows: 
LIPS = 47.40 (14.14), SB = 50.59 (11.55), WAIS-V = 60.03 (7.92), WAIS-P = 
61.69 (14.30), and WAIS Full Scale = 58.47 (10.56). They increase in the order 
LIPS - SB - WAIS, as expected from the literature. 

The correlations between the four test scores are as follows: LIPS vs. SB = 0.77. 
LIPS vs. WAIS-V = 0.59, LIPS vs. WAIS-P = 0.75, SB vs. WAIS-V = 0.79, SB 
vs. WAIS-P = 0.78, and WAIS-V vs. WAIS-P = 0.71. With one exception (LIPS 
vs. WAIS-V), these coefficients are fairly uniform. 

For cross-validation of the Wechsler Performance /Verbal discrepancy effects, 
described in a separate paper®?, a score of WAIS-P minus V (P-V) was derived. 
The present mean is 1.67 (SD = 10.52). 

Chronological age. Age varied considerably about the mean of 36.91 years, SD 
being 11.88. Correlations between this factor and the test scores are as follows: 
LIPS = —0.12,SB = 0.00, WAIS-V = 0.13 and WAIS-P = 0.26. Only the latter 
coefficient is significant at the 5% level, indicating that age plays no significant role 
in the present analysis. 

Procedure. The tests were administered in a balanced order to control for se- 
quence effects. The Binet and Wechsler tests were scored in the conventional man- 
ner. The scoring procedure for the LIPS was as suggested in a personal letter by Dr. 
R. G. Leiter: MA multiplied by 100 and divided by a constant of 156 with 5 IQ 
points added (for Ss of more than 13 years). This procedure differs somewhat from 
that described in the manual”. 

The criteria for the Performance /Verbal dimension were the LIPS, the rank 
order combined WAIS-V: LIPS and the WAIS-V score. The first and last score 
differ most with regard to both the Performance /Verbal dichotomy and intercor- 
relation. Variation in intelligence level was effected by dividing the sample into 
non-overlapping quintiles on the basis of each of the three criteria of the Perform- 
ance /Verbal dimension (Table 1, quintiles L to H). 
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RESULTS 


The main findings are given in Table 1. Test means are shown as a function of 
the five intelligence levels and three intelligence criteria. The slope of the individual 
test gradients, as determined by the quintile means, is steepest for LIPS and WAIS- 


TaBLe 1. Test MEANS AND ¢-Ratios BETWEEN MEANS AS A FUNCTION OF FIVE INTELLIGENCE 
LEVELS (SAMPLE QUINTILES) 
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L = low, M = medium, H = high intelligence level. All means based on N = 15. DF = 280 (Re- 
mainder of Analysis of Variance). 


Significance level: * = 5%, ** = 1%, *** = 0.1% 


P, the two performance scores and flattest for WAIS-V, a verbal score. The gradient 
slope is further dependent on the kind of IQ level criterion. It increases in the order 
Verbal—combined Verbal /Performance—Performance criterion. 

Significance of these results was assessed by means of analysis of variance. 
F-ratios for the WAIS-V criterion were as follows: Tests = 51.5, level of intelligence 
= 84.0, and T X L = 0.5; for the combined WAIS-V: LIPS criterion: T = 74.8, L = 
148.2, and T X L = 2.5; for the LIPS criterion: T = 63.7, L = 106.6, and T X L = 
5.1. The DF are: T = 3, L = 4, T X L = 12, and Remainder = 280. The F-ratios 
for tests are significant at the 0.1% level. The test by level interaction (variation in 
gradient slope) proves significant and significantly dependent on the level criterion. 
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Along the criterion dimension, this interaction increases from insignificance (Verbal 
criterion) to a 0.1% level of significance (Performance criterion). 

We may now proceed to test the significance of more specific effects using ¢t- 
ratios between all tests for IQ levels as shown in Table 1. The respective Mean 
Squares of the Remainder served as variance estimates. They are 71.4 for the WAIS- 
V criterion, 49.2 for WAIS-V: LIPS, and 57.8 for LIPS. 


Point 1. Significant differentiation is obtained between any combination of two tests and 
at most points of the analysis. More than two-thirds of the ¢-ratios are significant at the 5% level, 
the average ratio being 3.3. The significance of differentiation decreases steadily from low to 
high IQ level, as follows: L = 4.35, M L = 3.51, M = 2.98, M H = 2.82 and H = 2.78 (t-ratio 
means), 7.e., means differ most widely at the lowest IQ level. The mean difference between the 
LIPS and WAIS-V, for example, amounts to 26.9 IQ points (Table 1, L level of LIPS criterion). 

Point 2. The difference in gradient slope produces a significant crossing-over effect between 
the two Wechsler Scales. The Verbal IQ is significantly higher than the Performance IQ at the L 
level, while the opposite applies for the H level. This confirms the result of a separate investiga- 
tion®). More pronounced crossing-over effects would, of course, have been obtained if sample 
means between tests had been identical, particularly in the case of the LIPS/WAIS — V inter- 
action. 

Point 3. For the Verbal criterion, average differentiation between tests is very similar for all 
IQ levels, and the otherwise characteristic Performance/Verbal effects are almost neutralized. 
As to the remaining two level criteria, effects specific to the Performance/Verbal dimension be- 
come more pronounced and differentiation, particularly at low levels, increases. This result 
throws light on the relative efficiency of intelligence tests in discrimination between individuals. 


Wechsler Performance-Verbal discrepancy. Previously obtained relationships 
between P — V and intelligence? are practically duplicated. P — V correlates 0.40 
with LIPS, 0.49 with SB and 0.63 with WAIS (Full Scale). All coefficients are sig- 
nificant at the 0.1% level (N = 75). Relative or absolute superiority of Perform- 
ance, therefore, correlates positively and significantly with intelligence, regardless 
of the intelligence test used. 


Chronological age, again, correlates positively with P — V, the coefficient of 
0.23 just reaching the 5% level of significance. Age and WAIS Full Scale, from which 
the P — V score is derived, are themselves correlated to a degree of 0.26. Partialling 
out intelligence, the correlation between age and P — V is 0.09, not significant. 
However, WAIS Full Scale correlates 0.61 with P — V after partialling out the 
effects of age. This suggests that, with intelligence held constant, age does not cor- 
relate significantly with P — V. 


DIscussION 

Some IQ discrepancies are apparently due to overall task difficulty rather than 
kind of test items used. A comparison between the LIPS and WAIS-P scales shows, 
for example, that differences are fairly uniform for all conditions represented in 
je 1. Differences of this kind may be easily eliminated by adjusting scoring pro- 
cedures. 

Other discrepancies appear dependent on the kind of test, whether Verbal or 
Performance. Differentiation between individuals is best for the Performance scores, 
as indicated by their greater steepness in gradient slope. Even the use of a Verbal 
score as criterion of IQ level (WAIS-V) does not lead to superior individual differen- 
tiation of Verbal over Performance scores. This may be due to item heterogeneity of 
the WAIS-V or the SB. Some items may be more strongly accumulative with regard 
to Verbal and others with regard to Performance intelligence. The discussed P — V 
discrepancies could, of course, be removed by analysing both verbal and perform- 
ance items against the same intelligence criterion. However, in the interest of more 
specific measurement it is suggested that such differences be maximized rather than 
concealed or, in other words, that specific rather than general aspects of intelligence 
be emphasized. Efficient prediction of other criteria of adjustment and /or person- 
ality may greatly depend on specialization and purification of test scores. 

The comparison of pure Performance and Verbal tests may possibly lead to new 
insight into the problem of trainability and /or educability of the retardate. It may 
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be contended that, as Performance falls increasingly short of Verbal capacity, train- 
ability decreases. The crossing-over point, where superiority of Performance or 
Verbal 1Q is established, may perhaps serve as a cutting point for the prediction of 
trainability. To investigate this problem, more refined Performance and particularly 
Verbal measures of intelligence are required. This includes the construction of tests 
capable of discriminating more sensitively between moderately and severely re- 
tarded individuals. The LIPS appears a promising step into this direction. 


SUMMARY 

The Leiter International Performance Scale, the Stanford-Binet (Form L) and 
the Wechsler Verbal and Performance Scales were administered to 75 male retard- 
ates. Differences between tests were analyzed as a function of intelligence level, 
there being three level criteria: a Verbal, a combined Verbal /Performance and a 
Performance criterion. Considerable differences were obtained between all tests. 
The degree of differentiation was complexly determined by test difficulty, kind of 
test (whether Verbal or Performance), intelligence level and kind of level criterion. 
Difficulty was assumed to simply increase overall differentiation. The Verbal / 
Performance test dimension interacted significantly with intelligence level, Verbal 
IQs being superior at low levels and inferior at high levels. The degree of this inter- 
action, again, is dependent on level criterion. With a verbal criterion, the interaction 
was insignificant, while a Performance criterion produced greatest and highly sig- 
nificant interaction. 
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ANXIETY AS “EFFECT OF UNCERTAINTY”: AN EXPERIMENT 
ILLUMINATING THE OA SUBTEST OF THE WAIS'! 
EILEEN S. LANFELD® AND D. R. SAUNDERS 


Educational Testing Service, Princeton, N. J. 


INTRODUCTION 


A factor analysis“? of split-half scores from the Wechsler Adult Intelligence 
Scale (WAIS) did not yield any factor clearly identifiable with Object Assembly 
(OA), although eleven factors were extracted and all the other subtests except 
Comprehension exhibited reliable unique variance. However, a subsequent study “”? 
focused on the items of the Picture Completion (PC) subtest did produce one factor 
clearly related to OA. Therefore, it seems likely that Object Assembly confounds at 
least two variables: a “performance”? component held in common with Block De- 
sign (BD) and, as Rapaport’ has suggested, an anxiety component held in com- 
mon with certain PC items. Further verification of an anxiety component in OA 
may be sought by devising other measures that should correlate with it especially 
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well. Most recently, this component has been found in a broader context of tests 
including the Afrikaans Wechsler®?, but still has not been shown to involve any 
non-‘‘performance”’ tests. 

We have hypothesized specifically that the potentially anxiety-provoking 
feature of the OA task is that the subject is not given any model to copy, so that he 
must manipulate and organize the pieces of each puzzle in order to learn what it 
makes. If this ignorance proves bothersome to the subject, we may call him anxious. 
If this “effect of uncertainty” hypothesis is correct, then the similar verbal task 
embodied in the Sentence Arrangement Test to be described below should cofrelate 
with OA and PC but not with BD. 


PROCEDURE AND RESULTS 


The Sentence Arrangement Test (SA) and the Picture Arrangement subtest of 
the WAIS are very similar in their mechanics of administration. However, instead of 
containing a picture, each of the cards laid before the subject in SA has from one to 
three words printed on it in large capital letters.* The cards for each item are pre- 
sented in a standard random order, and the subject’s task is to arrange them in 
sequence so the words ‘‘say something.’’ The time required to reach a grammatically 
correct arrangement provides the basis for scoring. In a pilot experiment, using six 
adult subjects, we observed that taking logarithms (to the base 10) resulted in de- 
tection of a linear relationship between length of sentence and time required for 
solution. 


Experiment 1. Hypothesis: SA correlates with over-all ability as well as do any of 
the regular WAIS subtests. 

Fifteen items, ranging in length from eight to nineteen words (four to ten cards), 
were administered to twenty-one new subjects, aged eighteen to twenty-five, each 
of whom had previously taken the WAIS. The subjects’ normal levels (NL) on the 
WAIS*‘ ranged from 9 through 18. 

The product-moment correlation coefficient between NL and total log-time for 
this sample was —.78. This result is highly significant statistically, indicates a 
marked relationship between SA performance and intelligence as measured by the 
WAIS, and is in line with the hypothesis. 

Analysis of variance of the 15 by 21 table of log-time scores showed that sen- 
tence characteristics other than length can play a significant role in performance on 
SA. By plotting item difficulty against sentence‘length, we found that two of the 
fifteen items were abnormally difficult. These items were ‘‘ / He had / the knowledge 
‘that what / we love we / may also / despise. /’’ and ‘“/ About all / her prob- 
lems, / Carol, at / sixteen, was / not in / the least / worried. /’’ We interpreted 
the latter as an artifact caused by the complex punctuation, and the former as 
possibly due to the potentially threatening content of the item. 


Experiment 2. Hypothesis: SA’s correlations with WAIS subtests, particularly PC, 
BD, and OA, are as stated in the introduction. 

The total log-time score on SA was correlated separately with each of the 
WAIS subtests, using the data of experiment 1. In order to remove the effect of 
general intelligence from these correlations, we first converted the subtest scores 
into deviations from NL for each subject, and these deviations were correlated with 
SA by the product-moment formula. The resulting values are shown in Table 1. 
I, OA, and PC are the only subtests significantly related to SA when holding NL 
constant. Both OA and PC have been shown elsewhere: 7) to involve the “effect of 
uncertainty” factor, which we deliberately incorporated in Sentence Arrangement. 


‘Examples: / For the / tenth time / he got / up and / looked out / of the / window. In the 
morning I / bought three / tickets for / the two- / o’clock bus / to town./ A complete list of the 
SA items employed may be found in “). 

‘NL equals the sum of weighted test scores (WTS) for I, C, D, A, 8S, PA, PC, and BD, minus 
WTS for OA, all divided by 7. For further explanation see “: ®), 


‘ 
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TABLE 1. CORRELATIONS OF SA wiTH WAIS (N = 21) 


(I NL) vs -SA: 53°* (PA — NL) vs -SA: 
(C -— NL) vs -SA: 00 (PC —- NL) vs -SA: 
(D NL) vs -SA: 08 (BD - NL) vs -SA: 
(A NL) vs -SA: 35 (OA NL) vs -SA: 
(S - NL) vs -SA: 14 (DS - NL) vs -SA: 
**Significant beyond the 1% level (one-tailed test). 
*Significant beyond the 5% level (one-tailed test). 





The high correlation with Information is appropriate for a verbal test and serves to 
replace the high correlation typically encountered between OA and BD. 


Experiment 3. Hypothesis: Performance on SA may be adversely influenced by 
employing sentences incorporating indirect psychological threat. 

Thirteen items were administered to nineteen new subjects, aged eighteen to 
twenty-five. The items included eight well-behaved items from experiment 1, the 
two deviant items, and three new items: / If / you / believed that / he / might 
shoot / our mother / I / would give in / right away. Since / money / is strong 
and / God / is even / more powerful, / there / is nothing / to fear. // Really, , 
when / we are / alone / together / I cannot / take time / for sleep; / what / should 
I / do? / The latter three items were newly written in a deliberate effort to in- 
corporate a degree of threat that would be apparent only to subjects attending to 
the meaning and implications of the sentence as a whole. 

The 8 by 19 table of log-time scores for the ‘presumedly normal items was sub- 
jected to an analysis of variance summarized in Table 2. (The analyses of variance 


TABLE 2. ANALYSIS OF VARIANCE FOR EXPERIMENT 3 


Effect Variance Var/Or F-ratio 
Grand Mean 151.700 151.700 4311.34*** 
Persons 6.273 348 Te inetd 
Sentence Length 12.073 12.073 335.62°°* 
(linear regression) 
Sentences, not Length 1.176 ) 196 6.69*** 
(deviation of means 
from line) 
Residual (‘‘Error’’) 26 0309 (1.00) 


Totals 





***Significant beyond the 0.1% level. 





were performed following McNemar’s model for ‘“‘analysis of bivariate correlation”’. 
2, pp. 270 ff.)) Since the error variance for this experiment does not differ significantly 
from that of the first experiment, the results are probably comparable. Some re- 
duction was achieved in variance due to sentence effects other than length in com- 
parison with the variance due to sentence length itseif, but the reduction is not 
significant statistically. The effects of persons, sentence length, and sentences not- 
length were all statistically significant in both analyses of variance. 

In this experiment, if difficulty is plotted as a function of sentence length for 
the eight ‘‘normal”’ items and the five experimental items are compared with the re- 
sulting regression line, each of the five is found to deviate significantly from the 
line at better than the 1% level, by one-tailed test and in the expected direction of 
greater difficulty. We concluded, therefore, that performance on SA is influenced 
not only by sentence length but also by sentence content, that a major effect of 
content may be interpreted in terms of threat, and that items can be written that 
deliberately incorporate this threat. The threat may act to increase the “effect of 
uncertainty” in the items involved, and thus lengthen the time required for at least 
some subjects to arrange them. 





ANXIETY AS “EFFECT OF UNCERTAINTY” 


SUMMARY 

We have described some preliminary steps in the construction of a Sentence 
Arrangement Test, designed for individual administration in conjunction with¥ the 
WAIS. High mean performance on an SA item is associated with the use of a short 
sentence, the avoidance of internal punctuation as a cue to the solution, and the 
avoidance of psychological threat in the content of the solution. High SA perform- 
ance by an individual subject is dependent on general intelligence, a factor of verbal 
information, and a factor of freedom from the ‘‘effect of uncertainty”’. (The latter is 
regarded as equivalent to “situational anxiety’’.) It appears to be possible to con- 
struct and administer a reliable form of SA that is valid for these factors. 
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AN EVALUATION OF RORSCHACH RELIABILITY THROUGH THE 
ALTERNATE-RESPONSE METHOD 
HARRY STEIN 


New Jersey Diagnostic Center, Menlo Park 


PROBLEM 

The traditional method of determining Rorschach reliability has been by the 
split-half method and the usual method of splitting the cards has been by comparing 
the responses on the odd cards to the responses on the even cards“: *: ®). The major 
assumption of the odd vs. even split is that this division produces an equality of 
stimulus values’. To the extent that discrepancies exist in the stimulus value of 
odd vs. even cards, a minor assumption is involved, 7.e., that all responses to a card 
are determined directly by the card. If a person gives more than one response to a 
card, he must, so to speak, eradicate the first response and return to the card for the 
stimulation of his next response. This assumption is necessary for the following 
reason. If the individual’s second response to any card is determined by his previous 
response rather than the card stimulus, we may see a “run”’ of responses which will 
exaggerate any discrepancies in stimulus values between the odd and even cards. 
This would result in a lower split-half reliability than could be indicated by an alter- 
nate response method of determining reliability. 

A concrete illustration of the problem might be helpful. In a test employed 
clinically by the author, the patient is requested to write down as many different 
words as he can in three minutes. An individual writes down the following: ‘‘Mother, 
father, sister, brother, table, chair, lamp, ashtray, cat, rat, lion, tiger, India, China, 
Spain, Italy’. (This pattern of responding is not uncommonly given by a hospital- 
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ized patient population.) Let us suppose each group of 4 words were responses to one 
stimulating idea, or, relating it to the Rorschach, each group of 4 responses was 
given to a Rorschach card. Thus, by the traditional method of splitting the cards, 
we would be comparing ‘mother, father, sister, brother’ with ‘cat, rat, lion, tiger’, and 
‘table, chair, lamp, ashtray’ with ‘India, China, Spain, Italy’. Obviously this makes 
no sense since we would be comparing family members with animals and office equip- 
ment with countries and the net results would be high reliability for R but zero re- 
liability for content scores. However, the alternate response method would compare 
mother to father, sister to brother, cat to rat, lion to tiger, etc. This method does 
make sense out of the subject’s responses, if one employs a method of scoring the 
responses such that the similarity in content is revealed. 

A further illustration might clarify both the problem and the hypothesis tested 
in this study. If one were to find a number of Rorschach records with 20 responses, 
of which 2 were given to each card, would there be a greater similarity between re- 
sponses given to the same card or would the similarity be greater across cards ? The 
hypothesis was that a comparison of reliabilities obtained for 31 Rorschach scores by 
the alternate response method would be greater than obtained by the split-half 
method. 


METHOD 

Since it would be difficult, time consuming and perhaps even impossible to find 
a sufficient number of 20 response records, each of which has 2 responses per card, 
the author used a sample of 53 individuals with varying degrees of physical dis- 
ability, described elsewhere“, where the average number of responses to the Ror- 
schach was 19.6. The Klopfer and Kelley °? method of administration was employed 
with some minor modifications “?. 

Because the literature“? has indicated a “succession” effect in the pattern of 
W, D, d, Dd, a method was devised for equalizing this factor across the split con- 
sisting of grouping the cards with respect to the number of responses given. Thus, 
FiGuRE 1. METHOD AND ILLUSTRATION OF DivipING A 20 Response RorscHacH RECORD TO 

CALCULATE ALTERNATE RESPONSE RELIABILITY! 


Cards IV 1 Vl IX x 


No. Responses 
Per Card 

Starting 
Position 


A 


Card I 1 


Card II 


Card Ill 
Card IV 


‘What the author was attempting to avoid with this method of dividing the responses was the 
case of an even number of responses to each card all taking the form of W, D, d, Dd. In this case, a 
simple alternation would place all the W and d responses on one side of the split and the D and Dd 
responses on the other side of the split. 
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all cards with an even number of responses were alternately started on different sides 
of the split starting With A, while those cards with an odd number of responses were 
alternated starting with B. An example of the division of responses is given in 
Figure 1. 

RESULTS AND Discussion 


A comparison of the reliability correlations obtained by the split-half and 
alternate response methods is given in Table 1. Instead of an improvement in the 
reliability correlations, there was a non-significant decrease from the average re- 


TABLE 1. COMPARISON OF ALTERNATE RESPONSE AND SPLit-HA.F RELIABILITIES 
WITH UsuaL METHOD OF RORSCHACH ADMINISTRATION.* 


Split-Half Alternate Response 
Score Per Cent Reliability Reliability 


Ww% 459 628 
D% 286 507 
d% 555 244 
Dd% 289 348 
S% 023 221 
Location Average** .334 410 
M% 058 .179 
FM% 283 257 
m% .273 166 
F] .354 527 
k% 102 109 
Fc% 247 079 
cF, c% 214 021 
FK% 322 104 
K, KF&% .108 151 
C'% 286 518 
FC% 318 295 
C, CF% 230 193 
Determinant Average** .218 .191 
% .012 .104 
.472 439 

.058 .041 

.075 .079 

.094 .047 

546 685 

Plant% .156 253 
Geo% 484 247 
Art% .445 dll 
Arch% 433 110 
Cloud% 535 054 
Blood% 000 000 
Fire% 116 116 
Emb% 058 062 
Content Average** 240 .161 
Grand Average** 246 216 





The reliabilities given below have not been corrected by the Spearman-Brown 
Prophecy Formula. 
The averages were determined through z transformations. 


* 


liability of .246 for the split-half method to .216 for the alternate response method. 
The hypothesis that the reliabilities of Rorschach scores would be improved by the 
alternate response method must be rejected. 

An examination of the table and cogitation about the development of the 
hypothesis led to the realization that, in the administration of the Rorschach, the 
author had observed ‘‘runs’’ or sequence effects for such scores as Dd and F. Certain 
obsessive persons on some cards would give a long run of Dd’s whereas others would 
give long runs of F responses. It was this experience which had led through various 
stages to the formulation of the hypothesis. It would appear that a resolution of the 
clinical observation of “runs” and its relationship to reliability is more complex than 
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indicated by the slight reduction in average reliability from the split-half to the 
alternate response method. The data in Table 1 suggest that, for certain types of 
responses, there is a facilitating effect in that the likelihood of a similar response is 
enhanced, whereas for other types of responses, the emission of a similar response is 
inhibited. 

Let us examine the data for inhibition. If a person gives a response indicative of 
anxiety, this disquieting emotional state would predispose him to return to the card 
and search out a response as different as possible than the one he had given. Exam- 
ining the group of shading responses (k%, Fc%, cF,c%, FK%, and K,KF%), a 
sharp drop in reliability occurs from the split-half method of determination to the 
alternate response method. The average for the 5 shading scores by the split-half 
method is .161, whereas it is only .009 by the alternate response method. This inter- 
pretation is supported by the sharp drop in the reliability of Cloud responses for the 
alternate response method (from .535 to —.054). On the other hand, there is no 
support in the Anatomy responses, which remains the same for both methods of de- 
termining reliability. 

There is a general decline in reliability for all content scores from an average of 
.240 by the split-half method to .161 by the alternate response method. It has often 
been observed by the author, and others,,that patients frequently make self-critical 
comments about their inability to diversify their content on the Rorschach. Such 
comments as, “All I can see are animals’ and ‘‘What, another bat?” indicate a 
general desire to vary their content. The precipitous drop in reliability from the 
split-half to alternate response method for such content categories as Geography, 
Architecture and Cloud indicate that one such response tends to inhibit a similar 
succeeding response. 

The average reliability of Location scores rises from .334 as measured by the 
split-half method to .410 when determined by the alternate response method. This 
would suggest that the utilization of a W, D, etc., type of response, facilitates an 
identical type of response. Similarly, when an individual gives a Form response, he 
tends more strongly to give an identical response immediately following than he is 
likely to give a Form response to another blot as a new stimulus situation. 

Several factors must be kept in mind regarding the above analysis of the data. 
Since the initial hypothesis was not supported, the interpretations of the results are 
in the nature of new hypotheses requiring experimental confirmation. Also, the 
specific procedure employed in dividing the responses so as to avoid the “‘succession’”’ 
effect may have affected the reliabilities obtained with the alternate response 
method. Ignoring the “‘succession’”’ effect, which was minimal in the records, and 
taking a simple alternation may be a better procedure to employ. Although the re- 
liabilities of individual scores are quite variable“, the data in Table 1 strongly sug- 
gest a facilitating ‘“‘runs’’ effect for certain of the Rorschach scores, including the 
Location scores and F%. An inhibitory effect was observed in the Content and 
Shading scores. Should these observations be confirmed, we can accept these Loca- 
tion and F% alternate response reliabilities as being ‘‘truer’’ measures of reliability 
than those obtained by the split-half method. 

For purposes of experimental design it was necessary to assume that Rorschach 
responses, other than the first one, are either determined by the stimulus of the card 
or the preceding response. The response, of course, may be determined by both in 
varying degrees at different times during the test. For this condition, which is 
probably the most accurate description of emission of Rorschach responses, neither 
the split-half nor alternate response method of determining reliability will give the 
full measure of Rorschach reliability. 


SUMMARY 


On the basis of observations of ‘‘runs’’ of certain Rorschach scores, it was 
hypothesized that reliability of Rorschach scores would be greater when measured 
by the alternate response method than by the usual split-half method. This hy- 
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pothesis was not supported. However, the group of Location scores and F% yielded 
higher reliabilities by the alternate response method. An inhibitory effect on the 
successive emission of Shading and Content scores was seen in the lowered re- 
liabilities by the alternate response method than by the split-half method. If these 
observations are confirmed in additional studies they would suggest greater reliabillty 
for Rorschach scores than is presently presumed. 
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HOMOSEXUALITY AND SOME MMPI MEASURES OF 
MASCULINITY-FEMININITY! 
BERNARD S. AARONSON AND HOWARD R. GRUMPELT 


N.J. Bureau of Research in Neurology and Psychiatry 8 South Wells Street 
Box 1000, Princeton Kingston, Pennsylvania 


PROBLEM 

A number of recent studies“: ® *) have suggested that the personality differ- 
ences between men and women comprise more than a single dimension. The per- 
sistent stereotype that homosexuals are in personality more like individuals of the 
opposite sex seems as a result to require re-evaluation. In addition, the varying 
proportion of homosexual outlet noted by Kinsey, et al.“*: ®), and the delineation by 
Hathaway? of a number of different kinds of homosexuals, raise further question 
as to the validity of this concept. If the stereotype is true, then personality measures 
which differentiate men from women should also differentiate homosexuals from 
heterosexuals, even though the correlation between these measures is low. 


PROCEDURES 
To evaluate this, a sample of profiles obtained on the Minnesota Multiphasic 
Personality Inventory (MMPI) from 25 males with histories of exclusive or nearly 
exclusive homosexuality was gathered. Fourteen came from the New Jersey State 
Hospital at Trenton, seven from a VA hospital in Illinois, and four from a state 
hospital in Minnesota.? These were matched with 25 male patients at the New Jersey 
State Hospital with no history of any homosexual activity or preoccupation at all. 
Matching was on the basis of age and the type of patient population from which the 
individual cases were drawn. That is, those from a criminal unit were matched with 
patients from a criminal unit, those from an acute service with those from an acute 
service. The mean age of the homosexual group was 28.0, and of the heterosexual 

group, 28.3, which is not significantly different. 


1The basic data were gathered while both authors were on the staff of the New Jersey State 
Hospital at Trenton. 

*The writers are indebted to Dr. Robert E. Jones, Donald R. Peterson, and Mr. Roland M. Peek 
for their assistance in gathering these data. 
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The T score on the MF scale (Mrs), the rank of this scale in the profile (Mrr) 
consisting of the ten clinical scales set forth in the test manual, and the score on 
the masculinity-femininity index (Mrr1) based on a relationship among the scales 
Hy, Pa, Hs, and Pr“, was determined for each profile. These measures were inter- 
correlated to determine their relationship in each group. The extent to which each 
measure segregated homosexuals was tested by combining the two groups, elimi- 
nating those cases that fell at the median, and comparing the resultant distributions 
by chi square. The extent to which high or low scores on any of the measures was 
associated with homosexuality and heterosexuality was determined by comparing 
the mean scores on these measures in the two groups by t-test. 


RESULTS 

In the homosexual group, Mrs correlates —.60 with Mrr, significant beyond the 
.001 level, and —.28 with Mrr. In the heterosexual group, Mrs correlates —.58 with 
Mrr, significant beyond the .001 level, and .25 with Mri, Mrr correlates .31 with 
Mri among the homosexuals, and .19 among the heterosexuals. Only Mrs and Mrr 
show any appreciable relationship, but it should be noted that these are two different 
ways of scoring the same scale. The negative sign to these correlations simply re- 
flects the ordering of T scores from low to high as opposed to the ranking procedure 
in which a rank of one is assigned to the highest score. 

The chi square values obtained when the ability of each measure to differentiate 
homosexuals from heterosexuals was compared were 14.54 for Mri, 17.94 for Mrs, 
and 18.42 for Mrr. All of these are significant beyond the .001 level. All of the 
measures seem to differentiate homosexuals from heterosexuals. 

The optional cutting score for MF in differentiating these groups is at an index 
value of 46. Only nine homosexuals fall at or above this, and nine heterosexuals 
below. The cutting score for MFs is a T score of 60. Only four homosexuals fall be- 
low this, and six heterosexuals at or above. The cutting score for Mrr is a rank of 6. 
Only one homosexual fal!s below this and eight heterosexuals above. The optimal 
combination of these measures as indicators when applied to the present sample is 
to diagnose homosexuality when any two of the three deviate in a homosexual 
direction with respect to their cutting scores. This results in misclassifying one 
homosexual and four heterosexuals. A criterion of three misclassifies twelve homo- 
sexuals and one heterosexual, and a criterion of one, one homosexual and eighteen 
heterosexuals. 

The means and standard deviations on the three measures in both groups are 
shown in Table 1. While the differences on MFi are in a predicted direction, the 

TABLE 1. MEAN SCORES AND STANDARD DEVIATIONS ON THE MF INDEX, MF 


ScaLe Score, AND MF Score RANK IN THE HOMOSEXUAL AND HETEROSEXUAL 
Groups 


Homosexual Heterosexual 
Measure Mean S.D. 


4 14.1 
4 10.1 
3 1.8 


MFI 45 
MFS 0 
MFR 3 


variance in both groups seems large and the ¢-ratio is 1.55, which is not significant. 
The ¢ value obtained for the Mrs means is 5.24, significant beyond the .001 level. 
Mrr showed much more variability among heterosexuals than homosexuals, as man- 
ifested by an F value of 2.06, significant beyond the .05 level. The t-ratio, calculated 
with a formula for samples of unequal variance, was 5.78, significant beyond the 
.001 level. Although all three measures differentiate homosexuals from heterosexuals, 
and can be used in concert to make such differentiations, marked differences in score 
associated with membership in these groups appear only with Mrs and Mrr. In- 
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spection of the data on Mri shows that a small subset of homosexuals seems to get 
high, or very ‘“‘masculine”’ scores on this index although the larger part of this group 
falls within the ‘“‘feminine”’ range. 


DIscUSSION 


These data suggest that while male homosexuals in general seem to be more 
“feminine’’, a subset seems to exist who show more masculine behaviors. This is 
in accord with the earlier findings of Terman and Miles ‘*’, whose scale was derived 
on the basis of essentially the same comparison as Mri. The measures which differ- 
entiate homosexuals from heterosexuals best are originally derived from a compari- 
son of homosexuals with heterosexuals. The overall data raise the question of 
whether more differences exist between homosexual and heterosexual males than 
might be anticipated from the action of a dimension of masculinity-femininity con- 
sidered alone. While a personality orientation similar to that of the opposite sex 
seems conducive to the expression of homosexual behavior, it does not seem of 
equal relevance in all instances. The relationship may be more correlative than 
eausal. Conversely, since these groups do behave differently on measures related to 
masculine-feminine personality differences, the non-unitary nature of these differ- 
ences is again suggested. 

The cutting scores for the several measures as given here probably require re- 
vision in normal clinical use. The sample employed was carefully selected and the 
problem is different than selecting homosexuals from a random clinical population. 
Similarly, the 90% correct classification with a criterion of any two of the three 
measures pointing to homosexuality is undoubtedly contaminated by having been 
applied back to the sample from which the cutting scores were derived. Its sig- 
nificance is in showing that taking a combination of any two of these measures in- 
dicating homosexuality as the basis for diagnostic judgment provides the most 
accuracy. 


SUMMARY 


Application of three MMPI measures of masculinity-femininity to samples of 25 
homosexual and 25 heterosexual male patients shows that while all three differen- 
tiate homosexuals from heterosexuals, a subset of homosexuals exists who seem very 
masculine in orientation. While feminine orientation seems related to homosexual 
behavior, it does not seem equally relevant in all instances. These data suggest the 
heterogeneity of any postulated dimensions of masculinity-femininity. 
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EYSENCK’S “STRATIFICATION” THEORY AND THE QUESTIONNAIRE 
METHOD OF MEASURING PERSONALITY! 
C. M. FRANKS, E. A. HOLDEN AND M. PHILLIPS 


New Jersey Neuro-Psychiatric Institute, Princeton 


INTRODUCTION 

There is considerable disagreement concerning the equivalence of the informa- 
tion obtained from objective behavior ratings or check lists, direct observation and 
self-rating questionnaires. For example, Cattell and his associates“? assume that an 
almost perfect correspondence between behavior ratings and self-rating question- 
naires has been established, whereas Becker“ arrives at quite a different conclusion 
after a critical review of the evidence. The similarities and differences between the 
data obtained from these kinds of personality measuring devices may well be de- 
pendent both upon the general nature of the personality factor it is hoped to measure 
and the population being studied. 

It is possible that both techniques yield meaningful information about a subject 
and that the data obtained from either can be consistently integrated into the same 
personality model without excessive modification of the theory in question. There 
is clearly a need for further investigation of this problem and the parameters pertin- 
ent to the discrepancies between self- and behavior ratings in various populations. 
Meanwhile, the absence of such knowledge can present a considerable hazard to 
those who need to have an objective measure of observable personality behavior, 
especially if their concern is with the experimental investigation of theories which 
attempt to relate observable personality differences to various attributes of the 
central nervous system. Of such theories, that of Eysenck is undoubtedly the most 
clearly and provocatively formulated and it is, therefore, hardly surprising that it 
has received the most attention. 

Eysenck®? integrates constitutional factors, learning theory, and personality 
theory into a stratified system in which there are four main levels, all conceived of as 
being causally related to each other in hierarchical order. Level 1, the most funda- 
mental, is physiologically determined; level 2 is that of observable experimental 
laboratory phenomena such as eyeblink conditioning or figural after-effects; level 3 
is that of objectively observable primary personality traits such as sociability and 
activity; level 4 is that of observable attitudinal phenomena such as ethnocentricism 
and hedonism. The direct concern here is not with the merits or demerits of this 
theory per se but with one particular methodological aspect associated with its in- 
vestigation. Therefore, while it is fully recognized that the above brief description 
does scant justice to Eysenck’s carefully developed arguments, it is not necessary 
here to elaborate the theory in any great detail. It is sufficient to reaffirm that the 
fundamental level, governing a wide variety of behavior at different levels, refers to 
some constitutional, but at present largely hypothetical, property of the nervous 
system of the individual. 

As Eysenck®? points out, causally the progression is from level 1 to level 4. 
Historically, however, levels 3 and 4 were investigated first and independently, and 
then linked up experimentally with level 2 and hence, by inference, with level 1. 
The delineation of level 3 began with the description of two orthogonal personality 
dimensions, namely introversion-extraversion and neuroticism, these concepts being 
themselves defined in terms of observed relationships between various primary 
traits such as sociability, excitability, and impulsiveness. These dimensions are now 
conveniently measured by self-rating questionnaires such as the Maudsley Person- 
ality Inventory (MPI), although originally a variety of other measures were used. 


‘The authors wish to acknowledge the help of Miss Elinor Phillips in collecting and scoring part 
of the basic data. They are deeply indebted to Dr. David R. Sanders for his invaluable guidance in the 
treatment of the data and the provision of computational facilities. The project was supported jointly 
by the National Institute of Mental Health under Research Grant M2720, the Society for the In- 
vestigation of Human Ecology and the New Jersey Neuro-Psychiatric Institute. 
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In terms of Eysenck’s ‘‘stratification” theory, level 2 phenomena, themselves 
dependent upon the more fundamental processes of level 1, in their turn partly de- 
termine level 3 phenomena. For example, the hypothetical central processes of 
excitation and inhibition at level 1 are supposed to determine in part whether a sub- 
ject is good or poor at forming conditioned responses in the laboratory. And a person 
who is good at forming laboratory conditioned responses is, therefore, presumed to 
be good at forming conditioned responses outside the laboratory—he is presumably 
responsive to the stimuli in his environment and to the social climate, sensitive to 
new people and situations and cautiously discriminating in his observable behavior. 
Similarly, the poor laboratory conditioner is presumed tc be insensitive to new 
situations and the social climate around him and indiscriminating in his observable 
behavior. These characteristics may not unreasonably be described as the observabl: 
aspects of introversion and extraversion respectively. There are, of course, many non- 
observable attributes of introversion and extraversion which can also be explained in 
terms of conditioning but, since it is not possible for external observers to rate such 
characteristics, they are of no direct relevance to the present discussion. If, there- 
fore, only the externally observable characteristics of extraversion-introversion are 
considered, testable deductions can be made, deductions such as ‘‘Introverts form 
conditioned eyeblink responses more quickly than extraverts.”’ 


PROBLEM 

Testing such a prediction presents many difficulties of a methodological, tech- 
nical and theoretical nature which have been discussed by the senior author else- 
where‘**: 3. 4). Even under idealized circumstances, such as the existence of a per- 
fect measure of objective observable behavior, one might still not expect to find a 
perfect relationship between levels 2 and 3 as manifest in laboratory conditioning 
and observed behavior respectively. Thus, in certain circumstances, the good labora- 
tory conditioner might learn to form eztraverted behavior patterns if the need were 
great enough, whereas the poor laboratory conditioner would find much greater 
difficulty in learning introverted patterns of behavior and hence would be more likely 
to display the more general predicted relationship between conditioning and ob- 
servable behavior. Thus, the environmental situation, or the conditioning the sub- 
ject actually receives, may be at least as important as his conditionability. This is 
very apparent in the case of the delinquent who could be an antisocial delinquent 
because he is a good conditioner who has learned the wrong rules, or an asocial de- 
linquent because he is a poor conditioner who cannot learn any rules, good or bad. 

It is thus apparent that, even with the assumption that a perfect measure of 
observable behavior is available, the investigation of predicted relationships between 
laboratory conditioning and observable behavior presents numerous difficulties. If, 
however, the device used to measure this observable behavior is itself questionable, 
then the hazards involved in testing even such a simple proposition as ‘Introverts 
form conditioned eyeblink responses more quickly than extroverts’’ become enor- 
mously increased. 

Both Eysenck and his colleagues, and those who wish to evaluate his theory 
independently are, for the most part, currently using the MMPI (earlier studies, in- 
cluding those of Eysenck and the present authors, have used the scales of Guilford 
or Cattell or parts of the MMPI or the Cornell Index, but essentially the same argu- 
ment applies to these questionnaires if used to measure objective aspects of be- 
havior). The MPI consists of two orthogonal self-rating scales, one measuring 
extraversion-introversion (E) and the other measuring neuroticism (N). The E scale 
may best be described as a measure of social tendency °’. The items were devised by 
elaborate procedures involving factor and item analyses of other personality in- 
ventories, particularly the Guilford inventory of factors STDCR and the Maudsley 
Medical Questionnaire. The E and N scales have high ‘‘construct validity’, that is, 
the items making up each scale are highly correlated with the factors they purport to 
measure and insignificantly correlated with the other factors. 
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Some of the items in this inventory (e.g., ‘‘Are you inclined to keep in the back- 
ground on social occasions?’’) could be rated by external observers, others (e.g., ‘‘Are 
your daydreams frequently about things that can never come true?’’) can not be 
rated by external observers. For those self-rating items that can be fairly readily 
translated into a behavior rating form, a high agreement between the data obtained 
from the two forms might be expected. If Eysenck’s theory of the causal relation- 
ship between laboratory conditioning and observed social behavior is pertinent then, 
as long as the methods used are equally objective and reliable, it should pertain 
irrespective of the method used to measure this observed social behavior. A com- 
pletely objective method is presumably one in which, among other attributes, the 
subject is unaware of what is being measured and unable to influence the score in any 
way. In these respects, ratings by others are considerably more objective than 
self-ratings. Conversely, it might be argued that, for obvious reasons, the self- 
rater is sometimes in a better position to provide accurate information about himself 
than an external observer. Thus, quite apart from considerations such as the notor- 
ious unreliability of Yes/No questionnaires and complicating matters such as the 
presence of response set, acquiescence and social desirability factors, there are many 
reasons why the two kinds of ratings might not agree. Irrespective of the reasons 
why, if these two methods of measuring identical items of social behavior yield scores 
which differ appreciably, then serious difficulties arise in the testing of even the 
simplest deductions involving level 3.2. Unless he is prepared to establish different 
predictions for each beforehand, the investigator is faced with the problem of which 
set of data to accept, those obtained from self-ratings or those obtained from ratings 
by others. It is suspected that some of the equivocation arising out of the attempts 
to investigate Eysenck’s theory stems from such sources; in particular, the reliance 
on self-rating personality questionnaires as adequate measures of objective be- 
havior. 


METHOD AND PROCEDURE 


The Questionnaire. A 40-item questionnaire was prepared in two forms, one suitable 
for self-rating and one suitable for rating by others. Each questionnaire, differing 
only in a translation from the second to the third person singular, consisted of 16 E 
items, 14 N items and 10 buffer items. These 30 E and N items were selected from 
the 48 items of the MPI solely because they were directly amenable to external 
rating procedures. Unlike the original MPI, in which a ? response is permitted, the 
present questionnaires were of the Yes/No variety, one point being awarded for 
each keyed response. 


Subjects. One group consisted of 9 young, male alcoholics, all resident members of 
a 40 bed clinic who had been meeting regularly as a small unit for intensive group 
psychotherapy for at least three weeks. None of the Ss were illiterate, none were 
demonstrably psychotic or brain damaged. All were of adequate intelligence and 
educational background. 

The other group consisted of 9 apparently healthy male students, all residents 
together in the same fraternity house for at least two years. 


Procedure. Each group was tested separately, all S’s being given first the self-rating 
form of the questionnaire. Upon completion with no time limit, each S was then 
asked to complete, one at a time and in counterbalanced order, one rating on each 
of his 8 comrades. NoS knew the precise purpose of the investigation. Contamina- 
tion was avoided by making the one testing session brief and closely proctored, and 
by removing each completed questionnaire before S was given the next one. Most 
S’s completed all 9 questionnaires in about 2 hours. Candid responses were en- 


?This lack of agreement would, of course, present no problem in those personality theories which 
utilize questionnaires to measure self-image or the so-called psychodynamic aspects of personality 
functioning, since both self and “‘other’’ ratings could be argued to be measuring different, but per- 
haps equally valid, aspects of the same person. 
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couraged by stressing anonymity and the research nature of the project All S’s 
were volunteers. For other purposes, not of concern here, a brief set of sociometric 
questions was appended to each questionnaire. In these questions S was asked to 
establish a preference rating, or selection from among his colleagues, for future 
shared experiences in a variety of appropriate situations. 

Data Analysis. The E and N raw scores for each group were arranged in 9 X 9 score 
matrices with one row for each 8 judged and one column for each § acting as a judge. 
The self-ratings were transposed from their original diagonal cell locations to their 
respective row positions to form a tenth column which could then be treated as a 
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tenth set of judgments. The resulting empty cell in each column array was given the 
mean of all other judgments in the row identified by the empty cell and a Pearson 
product moment intercorrelation matrix obtained for each of the four sets of data 
(Table 1 Each matrix was then factor analyzed by the method of principal com- 
ponents and found to contain just one factor per matrix. The covariance loadings 
on the respective factors are presented in Table 2. 
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RESULTS AND DISCUSSION 


As might be expected, each set of data yield one common factor suggesting that, 
whatever it is the raters are agreeing about, there is only one school of thought 
underlying this agreement. In the normal group the “‘self-rating”’ is indeed the best 
single estimate of both the E and the N factor (if this is, not unreasonably, assumed 
to be what the unitary factors designate). In the patient group the ‘“‘self-rating’’ pro- 
vides the worst estimates of the respective factors. Individually many of the other 
raters do better, collectively they provide a much better estimate of the factor. 
Whereas in the normal group the “‘other’’ judges tend to be better at judging E, in 
the patient group they tend to be better at judging N. The correlations between the 
ability to judge others on E and the ability to judge them on N are greater for the 
normal group than for the patient group, 7.e., the patient group as judges are less 
consistent from scale to scale than are the normal group. 

From these observations, and in particular from a plot of the data presented in 
Table 2, it may be seen that, if the MPI scales (as modified here) are used to obtain 
estimates of a person’s self-rating E and N scores, then the agreement of these self- 
rating scores with “other judges” scores is likely to be high for normal subjects but 
low in the case of certain patient populations. Thus, if the investigation of Eysenck’s 
hypothesis is confined to carefully selected normal subjects using carefully selected 
“‘behavioral”’ items, then self-ratings are apparently as good as any careful judgment 
by others. However, when patient or neurotic groups are used (as in many in- 
stances) then a problem of questionnaire validity may arise since there may be no 
agreement between self-ratings and the judgments of others. 

It might be argued that, since the patient group had known each other for only 
three weeks, any discrepancy between a self-rating and a rating by another could 
be attributed to a lack of knowledge by the other of the person being rated; there- 
fore, one might expect the self-rating to provide the mere accurate information. Such 
an argument is not tenable since, despite a group acquaintanceship of only 3 weeks, 
the “‘other’’ raters agree among themselves in their independent ratings of any one 
individual. (The readily testable assumption here is that if the above procedure 
were adopted with a group of strangers no similar common factor would be found.) 

The present findings may explain the failure of Sigal, Star and Franks“? to 
differentiate hysterics, dysthymics and normals in a satisfactory manner by means 
of the MPI used as a self-rating device. It seems likely that, despite its face validity, 
the MPI is not a good measuring instrument of objective behavior when used in the 
advocated self-rating manner and with mentally or emotionally abnormal 8’s. In 
the light of the good agreement among independent judges a very real problem of 
self-rating questionnaire validity and utility arises when such questionnaires are 
used to provide objective measures of behavioral characteristics. (It should be 
observed that this criticism of self-rating questionnaires necessarily need apply 
neither to questionnaires used clinically as projective devices to obtain self-image 
concepts, nor to self-rating questionnaires of the MMPI variety in which the criter- 
ion is simply their demonstrated ability to differentiate among various diagnostic 
groups.) It is possible that many of the equivocal or negative findings—and per- 
haps, even some of the positive results, obtained in investigating predictions re- 
lating Eysenck’s levels 2 and 3 within disturbed populations may be in part attrib- 
utable to the fact that all these workers used self-rating questionnaires to measure 
E and N. In such cases, perhaps the judgments of external raters, or even actual 
situational observations, might provide better measures of the E and N factors. 

In conclusion, the clinical and research possibilities of utilizing a combination 
of self- and “other” rating techniques should be indicated. Further research is re- 
quired to relate variables such as pathology, group homogeneity and length and 
type of acquaintanceships to observed discrepancies between the two kinds of 
ratings for various dimensions of personality. It is also possible that the differences 
in the two kinds of scores might lead to fruitful hypotheses about possible areas of 
disturbance or conflict. The differences in reliabilities and stability of the two scores 
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might also be of value in this respect. Another use might be in the area of psycho- 
pharmacology, since there is already some evidence “* to suggest that certain drugs 
tend to increase both E type scores as measured by self-ratings and also the re- 
sponse set to reply ‘“‘Yes” irrespective of the question being asked. Thus, such a 
drug might be expected to modify items scored in the “Yes” direction more when 
behavior is self-rated than when this same behavior is rated by others. 


SUMMARY 

By a factor analytic method, it was found that self-rating measures of intro- 
version-extraversion and of neuroticism (using a modified version of the Maudsley 
Personality Inventory) agree closely with measures obtained by external raters, the 
self-rating technique providing the best measures of the respective factors. In a 
psychiatric population no such agreement was found, the external raters agreeing 
among themselves but not with the estimates obtained by self-rating. From these 
observations it is suggested that there may be serious objections to the use of self- 
rating questionnaires for the measurement of observable personality attributes of 
behavior in abnormal populations. This possibility is of particular relevance to 
those concerned with the investigation of predictions derived from Eysenck’s 
“stratification” theory of behavior. 
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PROBLEM 

A pilot study was conducted at New York State Psychiatric Institute to test the 
feasibility of sample survey methods for the evaluation of a mental hospital program. 
It involved an analysis of the hospital experience of patients as well as an examina- 
tion of the disposition of hospital resources and personnel directed toward the re- 
covery of patients. The focus of interest has been the individual patient— how his 
time is spent in various activities, how many and what kinds of treatments he re- 
ceives, how much time he spends in the hospital, what privileges he has, how severe 
his illness is as reflected in his behavior on the ward, etc. 


'This research was supported in part by a Public Health Service Research Grant (M-3546) from 
the National Institute of Mental Health. 
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Previous studies of psychiatric facilities have dealt with such questions as the 
preferability of a therapeutically oriented hospital setting as compared with a 
custodially oriented one“, the social structure of the hospital itself or characteristic 
features significant for the effective functioning of the hospital as a treatment 
facility But psychiatric studies, for the most part, have not been concerned 
with the representativeness of their samples nor have they taken advantage of con- 
comitant observations to determine the combined effects of competing influences. 
Statistical reports provide an overall picture of patients by age, sex, diagnosis, 
race, admission status and disposition. However, questions about the effectiveness 
of a mental hospital program cannot be answered by examining routinely gathered 
statistical data because the relevant information is lacking. To collect the necessary 
information by taking a census of the total population of patients would be pro- 
hibitively expensive. Moreover, a complete enumeration would still be only a sample 
taken at one point in time and subject to error). But some of these questions might 
be answered by surveying a representative sample of the population “: ” 

The advantages of a sample survey over a complete enumeration or census of 
the population are: (a) Timeliness of the data because of the rapidity with which a 
survey can be completed. (b) Increased precision because of more careful attention 
to collection of data at the source instead of from existing records designed for other 
purposes. (c) Comprehensiveness because of the simultaneous collection of data 
about variables which have important effects upon one another. (d) Representative- 
ness because collection of data need not span a prolonged time period with consequent 
confounding of seasonal fluctuations. 

When the aim is to gain insight into the causes that govern the observed fre- 
quencies so that they can be controlled in time to come, small samples taken at 
frequent intervals may give much more information by revealing trends and changes 
as they occur, than either a total count or a large sample taken at wider intervals, 
and often for the same cost’. Thus, the U. S. Bureau of the Census has found it 
both economical and practical to obtain reliable estimates of socioeconomic var- 
iables by collecting supplementary data from a 25% sample instead of from the 
whole population. The construction of an adequate sampling design is therefore a 
major enterprise in its own right. Such a sampling design would, of course, include 
provision for estimating the sampling error. 

The first task is to identify the critical sources of information in the hospital 
structure so as to determine the appropriate sampling units. Pilot studies are then 
necessary to obtain clues for reducing systematic error and for increasing the effi- 
ciency of sampling. Alternative sampling methods might be tried to ascertain whether 
simple random sampling, stratified sampling—with or without proportional alloca- 
tion, or cluster sampling lead to the optimum combination of precision with econ- 
omy. The report that follows presents only the frequencies of occurrence of in- 
dividual characteristics. Analysis of their interactions is still incomplete. 


DEFINITION OF THE POPULATION 

A sample was selected from the population of resident patients in the hospital 
register. The population was defined as all resident patients of record at the begin- 
ning of an arbitrarily designated observation week. New admissions during the week 
were added to the roster, but releases were not dropped. In this way a total popula- 
tion of 155 patients was identified, all of whom were registered as residing at the 
hospital during all or part of the observation week. Information pertaining to specific 
behavior and activity on the ward and to treatment was gathered retrospectively for 
the observation week. Observations of patients and their records were made at the 
end of the observation week in order to avoid the possibility that either the patients 
or hospital personnel would so alter their behavior as to bias the study. 


SELECTION OF THE SAMPLE 


The population was stratified according to sex; admission status (1st admission 
vs. readmission); and duration of current hospitalization (less than 6 months vs. 
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over 6 months). The population was found to be evenly divided between the two 
sexes; two-thirds of the patients were first admissions; half of the patients had been 
in the hospital less than six months. A twenty percent probability sample of 31 
cases was then composed by selecting random subsamples from each of the eight 
categories in proportion to their size. This was the largest sample that could be 
studied because of the limitations of time and staff. An effort was made to assure a 
minimum of two subjects for each category. As it turned out, one patient was erron- 
eously classified as a readmission and had to be shifted so that the category, readmit- 
ted males with over six months of hospitalization, ended up with only one individual. 
Table 1 shows the composition of the sample. 


TABLE 1. CHARACTERISTICS OF POPULATION AND SAMPLE 


First Admissions (Male) Population Sampl 


In hospital less than 6 months 17 4 


In hospital over 6 months 29 r 


Readmissions (Male) 
In hospital less than 6 months 20 
In hospital over 6 months 


First Admissions (Female) 
In hospital less than 6 months 
In hospital over 6 months 
Readmissions (Female) 
In hospital less than 6 months 
In hospital over 6 months 


Total 


SOURCES OF INFORMATION 
Data for the study were gathered from the case records, including nurses’ notes, 
doctors’ notes, and other clinical notations found in the records. Other written re- 
cords such as doctors’ order books, patients’ sign-out books, lists of patients’ ap- 
pointments with doctors, etc., were also abstracted. In addition, nurses were asked 
to fill out the Ward Behavior Rating Scale“? for each patient in the sample. 


FINDINGS 

Some of the proportions that follow are based on less than the total sample be- 
cause the data were missing or the category was only applicable to part of the sample. 

Admission Status. Almost all the patients in the sample were recorded as volun- 
tary (in one case the relevant datum was missing). The majority of the patients 
(81%) had been admitted directly from the community; 13% were from receiving 
hospitals, while 6% had been transferred from other chronic treatment facilities. 

Disposition. During the observation week the discharge rate was 3%. Move- 
ment of patients from one ward to another was also at the rate of 3%. 

Time out of the Hospital. Nineteen percent of the patients in the sample were 
either employed outside the hospital or attending school.’ Thirty-five percent had 
spent two or more nights out of the hospital; 67% were allowed out on unaccompan- 
ied walks; 80° had the freedom of the building; 83% had special pass privileges 
either for outside work, for school, or for home visits. 

An Overnight Index was computed for all patients in the study who had been in 
the hospital at least one week by taking the ratio of number of nights slept out to 
total number of nights of current hospitalization. Seven percent had been in hos- 
pital,less than one week. For the rest the index ranged from zero to .26. Fig. 1 shows 
the frequency distribution for the Overnight Index. Twenty-one percent had an 


2An additional 16% attended school on the hospital premises. 
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FIGURE 1. FrequENCY PoLyGon SHOWING PERCENT OF PaTIENTS IN SAMPLE WHO SLEPT 
Out oF THE HosPITAL FOR VARIOUS PrRoporTIONS OF ToTaL HospItTat Stay. 
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Overnight Index of zero, i.e., they had slept in the hospital every night; 38% had 
slept out less than 1 /20th of the time; 10% had slept out more than 1 /20th but less 
than 1/10th of the time; 27% had slept out more than 1/10th but less than 1 /4th of 
the time; 3% had slept outside for over 1 /4th of the time. 

Visitors. Thirty-seven percent of those for whom the data were reported had at 
least one visitor during the week; 30% had no visitors but spent at least one night 
out during the week. The remaining 33% neither spent a night at home nor had a 
visitor. 

Security. Six percent of the patients were in a maximum security ward; 16% 
were in wards that were locked all the time; 13% were in the children’s ward which 
was locked all the time; 65% were in wards that were open part or all of the time. 
During the observation week, 10% of the patients were under special observation 
because of suicidal tendencies or attempts to escape; 3% were under restraint. 

Therapy. Ninety-seven percent received psychotherapy at least once during the 
observation week. Sixty-eight percent of the sample were recorded as attending 
ward meetings. Nineteen percent were reported as attending group psychotherapy 
sessions. Sixty-six percent participated in occupational therapy ; 7% received hydro- 
therapy; and 52% attended gym. Half of the children attended other group thera- 
peutic sessions, such as dance, art, and music therapy. 

Drugs. Because of the great variety of drugs and the varying dosages, a single 
day was selected in the middle of the observation week on which all drugs and dos- 
ages for each member of the sample were recorded. According to the records 29% 
of the sample received no drugs at all; 29% were on one drug; 16% were on 2 drugs; 
10% were on 3 drugs; 3% were on 4 drugs; 13% were on 5 drugs. For the 71% on 
drugs, the average number of different drugs administered during the day was 2.3. 
Altogether 30 different drugs were in use with the patients. Table 2 shows details of 
drug treatment for the observation day, the drugs being classified according to the 
pharmacological categories recommended for psychotherapeutic drugs by the AMA 
Council on Drugs.* Twenty-three percent of the sample received sleeping pills in the 
form of Doriden or chloral hydrate. Thirty-two percent of the sample were receiving 
phenothiazines: Thorazine, Compazine, or Stelazine, with Thorazine being admin- 
istered to 19% while the newer analogues were administered to 13% of the patients. 


3As reported in JAMA, 1958, 166, 1040-1041. 
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TABLE 2. 


Drug group 


Nonselective Depressants 


Central Muscle Relaxants 
(sedatives) 


Central Sympathetic 
Suppressants 
(phenothiazine derivatives) 


Central Nervous System 


SURVEY OF MENTAL 


OBSERVATION Day 


Specific Drug 
Aspirin 

Chloral hydrate 
Doriden 

Codeine sulphate 
Dilantin 


HOSPITAL 


PATIENTS 


DruGs ADMINISTERED TO A RANDOM SAMPLE OF PATIENTS DURING A SINGLE 


Proportion 
of Patients 


Average 
Dosage 
620 mg 
517 mg 
500 mg 

15 mg. 
300 mg. 





Artane 


Thorazine 
Compazine 
Stelazine 


Imprimine 


3 mg 


408 mg. 
40 mg 
1l mg 


112.5 mg. 


Stimulants Catron 


6 mg 
Dexedrine 


7.5 mg 


Miscellaneous 


Librium 

SKF 556 

WY 2445 

Phenacetin and Ephedrine 
Miscellaneous* 


5 mg. 
7.5 mg 
100 mg 


Aspirin, 3 tablets 


*Includes neosynephrine, cortisone, colloidal gold, vitamins, colace, cogentine, feosol, gantrisin, 
Lugol’s solution, etc 


Twenty-three percent of the sample were on stimulants: Imprimine, Catron or Dexe- 
drine. 

Physical Health. An examination of the medical records of the patients revealed 
that 13% had physical anomalies, 39% had at least one chronic physical illness, 10% 
had more than one chronic physical illness, and 10% had some acute physical illness 
Three percent (8% of the women) were pregnant. 

Background Variables. Background variables which other studies have shown 
to be related to the etiology or prognosis of mental illness were also tabulated. The 
ages of members of the sample show that it is a comparatively young population 
Thirteen percent are children; almost half of the sample is under 25 years of age; 
almost half are between the ages of 25 and 44; only 6% are over 45. Fifty-nine per- 
cent of the adults are unmarried. 

Table 3 shows that 83% are American-born, the remainder being either natural- 
ized citizens or Puerto-Rican Americans. Sixty-three percent have at least one 
American-born parent; 13% are of Puerto Rican parentage, and 40% have one or 
both parents foreign-born. 

Three percent of the sample were reported to be drug addicts. Six percent of 
the patients were alcoholics. Instances of mental disorder in the family were reported 
for 26%. 


TABLE 3. PLACE OF BIRTH OF PATIENTS AND PARENTS 


father 
only 


mother 
only 


both 


Nationality patient parents 


American-born 2! 3 2 14 
Puerto Rican : l 3 
Naturalized American 

Other : ‘ 8 
Not Reported 1 


Totals 
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Socioeconomic considerations classify 21% upper class, 58% middle class, and 
21°% lower class. Twenty-seven percent were students. Forty percent of the women 
were classified as housewives. 

Behavior leading to hospitalization. Table 4 shows the behaviors which led to the 
hospitalization of the patient. Depression was found in 42%, anxiety and fears in 


TaBLe 4. BenHavior Leapinc To HospPItTaizatTion* 


Behavior Percent of Cases 
Depressed state 42% 

Anxiety, fears 35% 
Confusion, disorientation 32% 
Aggression towards self 29% 
Aggression tow ards others 29% 
Agitation and hyperactivity 26% 
Delusions and Suspicions 29% 
Withdrawal 26% 
Affective display 29% 
Physical complaints 26% 
Habits (alcholism, drug addiction, etc.) 13% 
Verbal peculiarities 138% 
Hallucinations 13% 
Bizarre behavior 10% 
School behavior problems 10% 


*The total exceeds 100% because more than one behavior was usually 

mentioned for a patient 
35%, and confusion and disorientation in 32%. Only 10% of the confusional states 
are described as organic. For children, the most frequently mentioned problems are 
those related to school behavior. This was true for three of the four children in the 
sample. Aggression towards self and others is also frequently mentioned. 

Readmissions. Seven of the eleven readmissions in the sample were two years or 
more removed from the time of their first admission. For these seven an Outcome 
Index* was computed for their experience during these two years after first ad- 
mission. The Outcome Index ranged from .02 to .93. Fifty-seven percent of these 
readmissions had Outcome Indices less than .125; 14% were between .125 and .50; 
the remaining 29° had Outcome Indices ranging from .50 to 1.0. Evidence from 
other studies) suggests that these figures may be interpreted as follows: the lower 
third are poor risks most of whom are likely to rotate in and out of hospital in future 
years; the upper third have a favorable prognosis for eventual adjustment in the 
community; prognosis for the middle third is guarded. 

Severity of Illness. Scores on the Ward Behavior Rating Scale“? ranged from 82 
to 145.5 Mean for the total sample was 120. Mean scores for males and females were 
about the same, as were those for patients with more or less than 6 months in the 
hospital. The largest difference was between first admissions with mean score of 122 
and readmissions with mean score of 116, but it was not statistically significant. 

The children in the sample were rated on a provisional form of the Children’s 
Behavior Rating Scale. In order to correct for age differences, the scores were con- 
verted to proportion of responses showing good adaptation. The range was from 
88° to 95° with a mean of 91%. 


CONFIDENCE Limits 
Because of the limitations of time and personnel and an error in the classifica- 
tion of the population which reduced one sampling category to a single subject, it 
was not possible to determine the sampling error empirically. Instead, theoretical 


t te r? where to = time out in days; 1 = follow-up period since first admission in days; r = no. 


l d 
of releases; a no. of admissions. The ©. I. ranges from zero, never released, to a hypothetical one. 
Higher scores indicate better adaptation; maximum possible score is 150. 
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confidence limits ‘4? were computed for each of the proportions by applying the 95, 
percent binomial limits to the conditional distribution of p, given n, the total num- 
ber of cases in the corresponding class, and n’, the number for whom the relevant 
datum was reported. Table 5 lists the upper and lower confidence limits for a selected 
group of proportions which were based on the total sample of 31 patients 


TABLE 5. CONDITIONAL 95% BINOMIAL CONFIDENCE LIMITS FOR SELECTED 
PROPORTIONS REPORTED IN SURVEY 


Proportion Lower Upper 
Factors obtained limit limit 


Treatment or disposition 
Discharged during week 01 
Employed outside or attending school : O9 
Slept out two or more nights 3 22 
On open ward a 418 
Under special observation 03 
Under restraint 
Received psychotherapy at least once 
Attended ward meetings 
Received group therapy 
Background characteristics 
Physical anomaly reported 
Had chronic physical illness 
Drug addiction 
Alcoholism 
Age groupings 
Children 
Under 25 years of age 
Between 25 and 45 years of age 
45 years of age and over 
Behavior leading to hospitalization 
Depression 
Anxiety and fears 
Confusion, disorientation 
Aggression against self 


SUMMARY 
A pilot study was undertaken to demonstrate the feasibility of sample survey 
methods for the analysis of the application of hospital facilities to the care of mental 
patients. The method of selecting the sample and the sources of information are 
described. The report includes substantive findings regarding the characteristics of 
the sample and the population, together with estimates of the theoretical confidence 
limits. 
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A BEHAVIORAL DISTURBANCE INDEX FOR PSYCHIATRIC PATIENT 
AND WARD DISTURBANCE 


JAMES L. FRAMO AND ARTHUR M. ADLERSTEIN 


Eastern Penna. Psychiatric Institute Phila. Child Guidance Clinic 
Philadelphia, Pa. Philadelphia, Pa. 


PROBLEM 


The Behavioral Disturbance Index (BDI) was developed as an attempt to 
categorize and quantify the pathological behavior of patients on a psychiatric ward, 
on both an individual and group basis. It is based on the systematic use of the routine 
nurse’s notes which are recorded in practically every psychiatric institution. The 
BDI was originally created for a project done by Rashkis and Wallace“? on the re- 
lationship between patient disturbance and staff attitudes. This is a report on the 
BDI, its description, range of applicability, reliability, validity, shortcomings and 
possible usefulness to those engaged in clinical research. 

Lorr’s“? excellent review article on “Rating Scales and Check Lists for the 
Evaluation of Psychopathology” distinguishes between scales for use by psychiatric 
aides and nurses and scales for use by trained clinicians. Standard psychiatric rating 
scales based on interviews are very serviceable for quantifying clinical judgments 
but they frequently yield scores based on guarded ‘‘office behavior” or staff confer- 
ence behavior which may be quite at variance with the patient’s everyday function- 
ing on the ward. Scales completed by nurses and aides introduce other problems. 
Ward personnel may become oversensitized to certain aspects of behavior while ig- 
noring others. Observations may be contaminated by such prestige questions as to 
who will be the evaluator of the ratings. There is no way of estimating in what way 
ward personnel affect the very behaviors being observed and evaluated. 

The BDI attempts to obviate these limitations and to yield a naturalistic 
measure of patient disturbance which more adequately measures functioning under 
living conditions of the ward. The spontaneous observations which are made most 
commonly and routinely are the daily nurse’s notes. Although psychiatric nurses and 
aides are generally required to make routine observations of patient behavior three 
times a day, these notes have rarely been used in any systematic way. It is recog- 
nized that there is considerable variability in quality and inclusiveness of nurse’s 
notes from hospital to hospital, but, assuming some minimal standards of reporting, 
they contain a valuable source of meaningful information. 


METHOD -AND RATIONALE 


The method used in determining the items of the BDI was inductive and empir- 
ical. The earliest stage of development consisted of reading more than 4000 nurse’s 
notes and from these were culled a wide range of symptoms, behaviors, and in- 
cidents. The categories underwent continuous modification as they were applied to 
subsequent patient evaluations. The BDI went through six revisions before the cur- 
rent list of 25 items was selected. These 25 items include practically all of the be- 
haviors likely to be reported in nurse’s notes. 

Most of the items include two sub-categories which describe disturbances at 
both ends of an item bi-polar construct. It was our belief that extremes of any kind 
should be viewed as disturbed behavior even when such behavior is not ordinarily 
considered pathological, or, is, in fact, generally socially rewarded. Such items as 
overforbearance, liking group activity too much, indifference to bodily illness, prud- 
ishness, fastidiousness and over-cooperation in ward routines are of this nature. 
With some of the categories (e.g., delusions) it was not possible to conceptualize an 
over-and under-reaction. The BDI differs from the usual psychiatric rating scale in 
that the emphasis is on psychopathological behavior; it does not allow for estimations 
of the ‘‘good” patient or of adaptive behavior. Psychiatric health, from this view- 
point, is considered simply as freedom from disturbance of the type which comes to 
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the attention of ward personnel. The validity study to be described will demonstrate 
that this does have real meaning. The 25 items of the BDI are given in Fig. 1 


Fic. 1. THe BeHAvioraL DIstuRBANCE INDEX 


Aggressive disturbance (motoric) b. Refuses privileges 


a. Against others 

b. Against self 

c. Against material objects 

Aggressive disturbances (verbal) 

a. Against others 

b. Against self 

ce. Against material objects 

Bizarre behavior 

Complaining behavior 

a. Whining, complaining 

b. Overforbearing; self-denial 

Compulsive disturbances 

Delusional disturbances 

Eating disturbances 

a. Refusal to eat 

b. Overeating 

Feeling disturbances 

a. Panic, anxiety attack, extreme fear 

b. Flat, apathetic, dull 

c. Inappropriate affect 

d “‘Nervous,’’ anxious, upset 

Hallucinatory disturbances 

Manners disturbances 

a. Unconcerned with social amenities 

b. Overpolite, obsequious, ceremonious 

Mood disturbances 

a. Elated, exhilarated 

b. Depressed 

Motor disturbances 

a. Overactive, pacing, running, agitated 

b. Immobile, catatonic, posturing 

Paranoid disturbances 

Participation disturbances 

a. Refuses to go to OT or group activities 

b. Likes group activity too much; too 
active 

Privilege disturbances 

a. Irresponsible privilege behavior 


16. 


SCORING 


Physical Complaint disturbances 

a. Complains of headaches, stomach aches, 
ete. 

b. Indifferent to bodily injury, illness 

Responsivity disturbances 

a. Overdependent, clinging, helpless 

b. Negativistic, resistant, stubborn, sullen 

Sex disturbances (both homo- and hetero- 

sexual) 

a. Masturbation, 
iveness 

b. Prudish, disapproving of sexual activi- 
ties 


exhibitionism, seduct- 


Sleep disturbances 

a. Oversleeping 

b. Insomnia; has trouble sleeping 

Sociability disturbances 

a. Seclusive, withdrawn 

b. Forces self on people; a “‘pest’”’ 

Thinking disturbances 

a. Confused, disoriented 

b. Perplexed, preoccupied 

Tidiness disturbances 

a. Sloppy in dress; untidy in toilet habits 

b. Overly fastidious in dress or toilet 

Verbal disturbances 

a. Tangential, irrelevant speech 

b. Rarely talks, mute 

ec. Loud, boisterous, shouting, overtalka- 
tive 

Visitor disturbances 

a. Overanxious about visitors 

b. Negative about visitors 

Ward routine disturbances 

a. Does not cooperate in a ward routine 

b. Overcooperates, seeks unneces- 
sarily 


work 


Miscellaneous (write in) 


A manual has been developed for the BDI which includes general principles of 


scoring, and descriptions and examples of each of the items.' The general principles 
of scoring from the manual are as follows: 


1. Inasmuch as the focus of interest is on disturbance, appropriate, healthy, trivial or 
neutral occurrences and behaviors are not checked on the BDI. In general, if a behavior or symp- 
tom is striking enough to be remembered and noted by the nurse or aide in her clinical notes, that 


disturbance should be checked if it belongs under one of the BDI categories 
nurse’s notes which would not be checked, ‘‘Patient seems to be happy today.”’ 
“Patient received a package today.’’ 


and conversed with staff today.”’ 
today.’’) 


(The following are 
“Patient outgoing 
“Patient had a visitor 


2. In complex behaviors, the separate items are checked, e.g., if nurse’s note says “Patient 
complained of stomach-ache, refused to get out of bed and ate little at breakfast’’, all of these 


items are checked. 


1Considerations of space prohibit reproduction of the entire manual 
senior author 


It may be obtained from the 
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3. If the same behavior occurs in successive shifts, it is checked provided the nurse or aide 
not reporting hearsay information (7.e., a nurse reports that she heard from the nurse on the 
st shift that the patient had a fight 


} Only disturbances which are reported are checked; no inferences should be made. If 
itient stayed in bed all day, just that item is checked on each shift and not other disturbances 
ich as participation disturbances, ete., which are a concomitant of this behavior. 


If the same behavior occurs in the same shift as part of an unbroken pattern it is con- 
sidered a single disturbance. Whereas, if its continuity is broken, i.e., an event or incident inter- 
venes, it is checked as frequently as this occurs. (E.g., patient complaining of a headache con- 
tinuously receives one check in this category, but if patient complains of a headache, gets into a 
fight, and complains of a headache again he gets two checks for Physical Complaint even though 
these occur during the same shift.) 


6. Nurses should be discouraged from writing such things as ‘‘No change in behavior,’’ 
Patient’s usual behavior.’’ When this does occur, as it sometimes does, the observer should 
check back to the past previous notation which stated what that usual behavior was and use his 
judgment as to what categories to check 
The actual scoring of the Index is simple and straightforward: the recorder reads through the 
i1urse’s notes for a particular period, say, one day, and makes a check — the appropriate 
item on the list. Checks are entered for each of the shifts on which the behavior or incident is 
ted. Behavior for which there is no category on the BDI may be described under Miscellan- 
eous; it is our experience that this very rarely has to be used. In the initial application of the BDI 
we were interested in the amount of disturbance manifested by patients over a one week period. 
A form was developed which previded for entries opposite each disturbance. This allowed for 
daily entries and a weekly total. A space was provided at the bottom of the form for the total 
number of disturbances observe -d for the week. This procedure can, of course, be modified for the 


purposes of a particular research problem. 


{ELIABILITY 


Inter-rater reliability was obtained by having two raters independently score 
the disturbances of 20 patients for a one week period. Reliability was derived by the 
use of two correlation techniques. The first method was to correlate the two sets of 


‘total’ scores; this was done to determine how much agreement there was between 
raters as to the gross amount of disturbance. The correlation was .97, significant 
well beyond the .01 level. This suggests that two raters trained in using the BDI will 
pick up almost all of the disturbances noted by ward personnel. 

This correlation, however, says nothing about agreement on particular items; it 
is theoretically possible to have agreement on total disturbance when in fact the 
raters have used entirely different categories. We therefore determined inter-rater 
reliability for the separate items for the week as well as the agreement on separate 
items for a particular day. The data presented a unique problem since not checking 
an item also constituted an agreement. In order to correct for this and to rule out a 
spuriously high correlation based on chance agreement, a formula was derived which 
took these factors into account.? The mean item inter-rater reliability for the week 
was .69 and the mean agreement for a particular day was .62 (an r of .428 is reliable 
beyond the .01 level). We can therefore conclude that inter-rater reliability is well 
within acceptable limits. 


?We wish to thank Drs. B. Karon and D. Estavan for their help in deriving this formula. Given: 
Cc, The number of blocks rater one checks 
C, The number of blocks rater two checks 
N The number of blocks the test is composed of 
P The number of agreements (N — the number of disagreements) 
The number of chance agreements 


C, C; (N — Ci) (N = se 


N N 
P — Pp’ 
Inter-rater reliability -a ae 


N +P 
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VALIDITY 


The validity of the BDI was assessed by comparing the scores of a group of dis- 
charged patients with the scores of a group of still-hospitalized psychotic patients 
We hypothesized that while both groups would be equally disturbed at the beginning 
of hospitalization (7.e., relatively equal on scores), the group of discharged patients 
should manifest less disturbance just prior to discharge than a comparable group of 
hospitalized patients. 

Twenty female patients from the Eastern Pennsylvania Psychiatric Institute 
constituted the validation study group. Of the 20 patients, 10 had been discharged 
from the Institute at the time of the study and 10 were still in the hospital. Criteria 
for both groups included hospitalization for a period of at least one month, no history 
of shock therapy, and the absence of a diagnosis of central nervous system damage. 
In addition, the patients in the discharged group had to have been discharged by the 
staff (not voluntarily signed out) and had to have remained out of the hospital for at 
least six months after discharge. The hospitalized group had to have been in-patients 
for a period of at least three months after the last recording of the BDI scores. From 
a large pool of discharged patients, all of whom met the criteria, we randomly selected 
the study group of ten. Ten other patients matched on the relevant variables were 
then selected from the still-hospitalized group. Entries on the BDI were made for 
-ach patient from nurse’s notes on the third and fourth day of hospitalization in order 
to allow several days for the patients to become acclimated to the hospital. Entries 
were also made for the discharged group from notes made three and four days prior 
to discharge. For the hospitalized group, entries were made using notes covering a 
comparable length of time. The mean length of the observation was 55 days for the 
discharged patients and 56 days for the hospitalized group. The two validation 
groups of improved and unimproved patients, then, were balanced on the para- 
meters of sex, age, diagnosis, and length of hospitalization. 

In order to test the hypothesis it was important to know if the two groups 
differed initially on BDI scores. The mean score for the still-hospitalized group was 
26 and for the discharged group the mean score was 19; this initial difference in mean 
scores was not statistically significant (¢ = 1.45, > .05). In other words, patients in 
both groups were about equally disturbed upon entering the hospital. We then tested 
to see if there was a difference in scores for the second or final set of entries. The mean 
of the discharged group had dropped from 19 to 4.5 while the mean of the unimprov- 
ed patients had dropped from 26 to 22 (t = 3.70, < .01). This clearly indicates that 
the BDI reflects judgments of the staff about readiness for discharge. 

As an additional test we calculated the differences between each group’s initial 
and final scores to see if the drop in mean scores represented a statistically significant 
drop. This was done by testing the difference scores in each group against a null 
hypothesis of zero drop. The difference ¢ in the still-hospitalized group was 1.05, 
which is not sufficient to warrant rejection of the null hypothesis. The difference ¢ in 
the discharged group was 3.70 (<.01). This indicates that the difference between 
initial and final scores for the unimproved group can be accounted for on the basis of 
a chance drop, indicating that there is no reason to believe that there was improve- 
ment in the still-hospitalized group. The difference for the discharged group prob- 
ably represents a true drop in score or disturbance. This validation study, then, 
clearly substantiates the hypothesis that the Behavioral Disturbance Index is a valid 
measure of change in the ward behavior of patients. 


DISCUSSION 


The Behavioral Disturbance Index has been demonstrated to be a reliable in- 
strument which can differentiate between recovered and non-recovered psychotic 
patients. The validity study demonstrates that the readiness for discharge as de- 
cided by patients’ psychiatrists is reflected in the lowered disturbance measured by 


the BDI. 
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In its present form the BDI points up only gross behavioral disturbance. It 
does not attempt to characterize patient disturbance psychodynamically and it is 
not possible to state with certainty that a higher score means greater psychopath- 
ology. This is not necessarily a disadvantage since it is well known that patients with 
severe inner pathology frequently manage to survive socially; the BDI appears to 
reflect the kind of overt disturbed behavior which comes to the attention of others 
socially. Society usually does not care how emotionally sick a person is unless they 
show it. 

It is recognized that the total disturbance on the BDI is a summation of dis- 
parate behaviors which are certainly not equivalent. The problem of weighting 
presents many difficulties. It might be possible to weight the different items in 
accordance with some theory of psychopathology, or empirically. The approach of 
Phillips and Rabinovitch ®? who statistically interrelated patterns of symptoms is an 
example of empirical treatment of these kind of data. Their categories of ‘“‘Avoid- 
ance of Others’, ‘‘Self-indulgence and Turning Against Others’’, and ‘‘Self-depriva- 
tion and Turning Against the Self’, appear promising as methods of ordering symp- 
toms and behaviors into meaningful clusters. 

Nurse’s notes have the obvious advantage of availability; the research investi- 
gator can rely on existing data and go back to any point in time. For example, he 
can select a group that was given a particular form of treatment and compare pre- 
treatment behavior with post-treatment behavior. He can also trace the movement 
of patients whose initial behaviors were similar and who experienced different courses 
of treatment. It might be possible to delineate from symptom clusters which patients 
are likely to respond to treatment. 

The BDI was originally devised to evaluate patient disturbance on an experi- 
mental ward. Just as an individual patient’s disturbance can be totalled, so can the 
disturbances of an entire ward be measured so as to reflect experimental or thera- 
peutic changes introduced on a ward, or the effect of ward personnel shifts. Those 
workers utilizing therapeutic community techniques can get a quantified estimate of 


ward excitement or ‘“‘tone’’. It should be remembered, however, that the BDI is 
more intended to enrich rather than replate other observations. 


SUMMARY 


A Behavioral Disturbance Index, based on categorization of behaviors and 
symptoms obtained from nurse’s notes, has been found to be a useful instrument for 
reflecting the degree of disturbance of an individual psychotic patient or whole wards 
of patients. This empirically derived method of evaluating patient or ward disturb- 
ance is relatively simple to use, proved to be reliable, and was valid as a reflection of 
the progress or chronicity of the hospitalized psychotic patient in the living situation 
of the ward. The BDI yields quantitative scores which can be related to any other 
relevant research data, such as the effect of treatment on an individual patient, or 
the effect on the total ward of introducing experimental or therapeutic regimes. 
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INTRODUCTION 


In a preliminary study we found that both group psychotherapy and consul- 
tation in behalf of male neuropsychiatric hospital patients were related to transfer 
from closed to open wards and to departure from the hospital via trial visit and 
medical discharge. Fisher’s Exact Test p was .06 for 14 in the intervention groups 
(7 who received 25 hours of group psychotherapy and 7 in behalf of whom there were 
21 hours of consultation) as opposed to seven for whom there was no psychological 
intervention. The data were secured 142 days after onset of the experiment and 74 
days after its termination. There was no psychological intervention in behalf of any 
of the patients during the follow-up period. The data indicated, however, that our 
matching of patients in terms of prognosis for recovery was not readily defensible and 
might have prejudiced our findings in favor of the treatment groups. The present 
study represents a replication of our pilot study with more strict controls. 

The experimental design for the present study was a joint production of many 
individuals—psychology staff, trainees and consultants in psychology from the Uni- 
versity of Tennessee. The three assistants! conducted the consulting and psycho- 
therapy as a part of their training at the Veterans Administration Hospital, Salis- 
bury, North Carolina. At the time of the experiment, they were advanced trainees 
abcut to return to the University of Tennessee for their last year of graduate study. 
All had some training and experience in group psychotherapy, but this was their 
introductory training as consultants in a NP hospital setting. The writer met with 
them at least weekly throughout the experimental period to formulate plans and dis- 
cuss problems. 

The procedure aimed at modification of those behaviors considered responsible 
for closed-ward status and NP hospitalization. We hypothesized that: (1) Group 
psychotherapy with and consultation in behalf of NP hospital patients would lead 
to open-ward status and non-hospitalization, and (2) alternatively, consultation in 
behalf of patients would be followed by more such changes than would follow group 
psychotherapy. In formulating the second hypothesis it was reasoned that consulta- 
tion relative to the patient with many of the hospital personnel would extend the in- 
fluence of the psychologist and be effective by reason of something of a ‘‘total push.” 
That is, the therapist spent only a few hours per week with the patients in group 
psychotherapy while nurses, nursing assistants, social workers and technicians spent 
many hours per day with them. It was thought that advising nurses and other 
personnel relative to altering the behaviors of patients would result in changes that 
would be reflected in open- rather than closed-ward status and in leaving and staying 
out of the hospital. 


METHOD 


Initially, 44 closed-ward, non-geriatric female NP hospital patients were 
randomized into three groups of 16, 15 and 13. At random one group was chosen for 
consultation (in behalf of), one to receive group psychotherapy, and the third as a 
non-intervention or control group. Other patients admitted to the same closed 
wards during the first 10 weeks of the 13-week experimental period were randomly 
assigned to one of the three groups and those who left the hospital during the first 

‘Assisted by William F. Gwynn, Ft. Hays State College; Zachery Mayberger and Garret H. Yan- 


age, University of Tennessee. I am deeply indebted also'to Dr. William E. Morris for assistance in 
devising this study and in preparation of this report 
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two weeks of the study were excluded. The experimental period extended from May 
5, 1959 to July 31, 1959, inclusive (88 days); the follow-up period from August 1, 
1959 to December 31, 1959, inclusive (153 days). 

All of the patient-subjects were members of one of the three groups throughout 
at least two consecutive weeks of the experimental period and throughout all of the 
follow-up period. The mean number of patient days for the three groups during the 
experimental period did not differ significantly. They were: I. Consultation group, 
74.6 (SD, 24.3); Il. therapy group, 68.2 (SD, 22.1); III. non-intervention control 
group, 77.0 (S.D., 17.6). The numbers of patients who spent at least two weeks of the 
experimental period in one of the groups varied because of chance factors and attri- 
tion through leaving the hospital. 

The same physician was in charge of all the patients throughout the study and 
follow-up periods and the make-up of the ancillary personnel was relatively constant. 
Personnel other than psychology staff had no knowledge of the nature of the study 
nor even that a study was in progress. This secrecy was maintained while the pa- 
tients were being selected and during the experimental and follow-up periods. 

The psychotherapy group (Il) met twice weekly for 13 consecutive weeks in 
sessions of about 90 minutes each from May 5, 1959 to July 31, 1959, inclusive. Dur- 
ing this period the therapists spent about the same amount of time with nurses, nurs- 
ing assistants, social workers and others (physical medicine and recreation techni- 
cians, etc.) in behalf of the consultation group (1). They gave no therapy to the con- 
sultation group during the experimental and follow-up periods and had no contact 
with the responsible physician relative to any of the patients in the study during that 
period. There was no psychological intervention in behalf of any of the patients in 
the study during the follow-up period. 

The therapy of Group II proceeded along more-or-less conventional lines but 
with more-than-usual attention to specific behaviors. For example, one patient in- 
cessantly and in a loud voice berated the personnel because she was “locked up.” 
Therapist and group efforts were directed towards getting her to use something of a 
“‘sales’”’ approach in her efforts to get attention and to get transferred to an open 
ward. 

Consultation in behalf of Group I was complicated by the fact that it was not 
sought by the consultees (nurses, et al) and that consultees had to be kept unaware of 
the criteria— open-ward status and de-hospitalization. We would have preferred a 
more normal consultant-consultee relationship but found no way of attaining it 
without violation of the secrecy we considered essential to experimental control. We 
deduce that secrecy was maintained from the post-study expressions of surprise of 
some of the consultees when informed that the study had been made. 

As the study progressed it appeared that the therapists were having difficulty 
with the consulting role. They couldn’t get in quite as much time consulting as they 
devoted to psychotherapy. They didn’t always know how to structure the consulta- 
tion—how to draw from nurses and others descriptions of critical behaviors, etc. 
They seemed to have some difficulty (mostly in the early stages) in centering con- 
sultation on behavior change and not on change in hospital status. We intended that 
treatment should center on behavior rather than on the criterion of changed status. 
We would have preferred that the therapists had no knowledge of the criteria, but 
we saw no way of accomplishing this. However, their knowledge of results was limit- 
ed since the data were not analyzed until after completion of the study. 

There was no known psychological intervention in behalf of Group III during 
the study. They were not taken into psychological therapy. No persons were ap- 
proached or consulted in their behalf by psychology personnel. When the therapist 
was approached by hospital personnel concerning a member of the control group, he 
resorted to evasion and subterfuge to avoid influencing the questioner. Discussion 
of members of the control group by the therapist was scrupulously avoided in all 
therapy and consultation contacts and was circumvented if possible when intro- 
duced by others. 
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RESULTS 


TABLE 1. Location oF MEMBERS OF THREE GRoUPS* 


Groups Closed-Ward Open-Ward Out of Hospital 


Start of Experiment: 
I. Consultation 23 
II. Group therapy 32 
III. Control 20 


End of 88-Day Experiment: 
I. Consultation 
II. Group therapy 
III. Control 


End of 153-Day Follow-Up: 
I. Consultation 
II. Group therapy 
III. Control 


*Corrected Chi Square p (one tail) favors the treatment groups combined 
versus the control group in terms of being on open wards or out of the hospital 
as follows: .05 at the end of the experimental period and .004 at end of the 
follow-up period. Seemingly, some of the effects of psychological treatment be- 
come apparent a considerable period of time after cessation of the treatment. 


The data support hypothesis 1 that psychological treatment via consultation (in 
behalf of) and group psychotherapy would lead to open-ward status and departure 
from the hospital. A trend in this direction at the .05 level of confidence was evident 
at the end of the 88-day experimental period (Table 1) and beyond the .01 level of 
confidence at the end of the subsequent 153-day follow-up period (Tables 1, 2 and 3). 


TABLE 2. MEAN PER CENT OF TIME GROUPS SPENT IN VARIOUS CATEGORIES WITH STANDARD 
DEVIATIONS OF MEANS IN PARENTHESES 


A B C 
N Closed-W ard Open-Ward 


Out of Hospital 


I. Consultation 2: 50.4 (7.8) 20.6 (5.9) 29.0 (6.8) 49.6 (7.8) 
II. Group therapy 32 33.4 (4.3 32.6 (5.9) 34.0 (6.2) 66.5 (5.7) 


III. Control 20 69.1 (7.8) 13.6 (3.8) 17.3 (6.0) 31.0 (6 


I+II 55 40.6 (4.2 27.6 (4.2) 31.9 (4.7) 59.4 (4.8) 


The data do not support hypothesis 2 that consultation (in behalf of) would be 
more effective than group psychotherapy in moving closed-ward NP hospital pa- 
tients to open wards and out of the hospital. In fact, there is a trend (not significant) 
in the opposite direction (Tables 1, 2 and 3). 


TaBLeE 3. CriticaL Ratios oF MEAN DIFFERENCES FROM TABLE 2. 


A B C 
Closed-Ward Open-Ward Out of Hospital 


I - Il 1 .92* —1.45 — .33 
I - Ill —1.69 1.01 1.29 
II - Il —4.04** 2.69%" 1.94* 
(I + II) - Ill -3 .23** 92* 


*Considered significant or nearly so. 
**Considered very significant 
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CONCLUSIONS 

The data show that both consultation and group psychotherapy were related to 
change to open-ward and non-hospital status, but that group the rapy was the more 
decisive factor. We think these changes re sulted from ¢ hanged behaviors and that the 
changes in behavior stemmed primarily from psychological treatment. However, our 
data do not properly define those behaviors. Unfortunately, we did not secure sys- 
tematic data which would relate specific behaviors to being in the hospital and to 
being on closed as opposed to open wards. We saw too late that we had opportunity 
for this. An occasional note by the therapist says, for example, “Patient has not 
written his folks that the F. B. I. is after him in the past three weeks.”” We hope that 
others will investigate this area. Actually, we do not know which behaviors, if any, 
were responsible for NP hospitalization and/or ward status of any of our subjects. 
Perhaps a hierarchy of behaviors is responsible. 
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EMPIRICAL SYNDROME GROUPINGS OF SCHIZOPHRENIC 
HOSPITAL ADMISSIONS! 
WILSON H. GUERTIN 
University of Florida 
PROBLEM 

The search for test correlates with schizophrenia is like the search for the Holy 
Grail; the object of the search may actually exist, but the only certain reality is in 
the minds of men. Conflicting proposals of relations between schizophrenia and 
behavior or test variables, and the failure to substantiate them upon cross-validation, 
usually are attributed to sample differences“: °-*”*), Since schizophrenia is gener- 
ally acknowledged to be a collection of variously described syndromes ‘*: ». 277), there 
is little reason to expect uniform sampling of subjects for research, and even w hen it 
does occur, the obtained results would have little application for other different 
samples of schizophrenia. What clearly is needed to provide a break-through in re- 
search on schizophrenics is some form of quantitatively specified typology. 

Armchair speculation about types of schizophrenia has persisted since Kraep- 
elin’s first proposals, with little general agreement among psychiatrists, and produced 
almost no worthwhile research findings. As we move from the armchair to a more 
quantitative study of similarities among individuals we first encounter linkage 
analysis‘’’. This approach, like the armchair speculation, fails to provide a unique 
solution to the problem of what are the particular types, z.e., many alternative solu- 
tions are possible. Just as the incorporation of the simple structure concept into 
factor analysis provides a unique and finite solution in analyzing clusters of traits, 
unique solutions as to types of people in the sample can be derived similarly from 
clusters of people (linkage-types). Transposed factor analysis does this by finding 
the simple structure solution for a matrix of intercorrelations between people, ex- 
pressing degrees of similarity with respect to such variables as psychiatric symptoms. 

In their efforts to identify major psychiatric syndromes, factor analysts have 
seldom focused on the schizophrenias exclusively. As in optics, the smaller the field 


1The author is greatly indebted to Drs. Frank du Mas, Richard Jenkins, Leonard Goodstein, and 
Sanford Golin for their many helpful suggestions. From the V. A. Hospital, Knoxville, Iowa. 
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area, the more apparent the detail becomes. Only two studies®: *) have employed 
behavior symptoms of schizophrenics in a transposed factor analytic design to get a 
glimpse at the underlying types of schizophrenics. Both these studies failed to 
describe the obtained type-factors statistically in terms of symptoms. This is a con- 
sequence of the transposed factor analytic methodology that gives results in terms 
of factor loadings for individuals rather than symptoms. Types derived in this 
fashion are, therefore, sometimes described verbally by referring to the symptoms 
held in common by the few heaviest loaded individuals on each factor. Or, more 
quantitatively, the relation of a symptom to a type-factor can be represented by a 
correlation or chi-square to express the presence of the symptom associated with 
individuals lying in the upper range of factors scores. 

Neither of these methods is consistently satisfactory for demonstrating the 
relationship between presence or absence of a symptom and the individual’s type- 
factor score. Pearson correlations will be low when only the most typical or least 
typical individuals show marked presence or absence of the symptom and there is no 
covariation throughout the remainder of factor-score range. Should the moderate 
factor loadings be associated with the presence or absence of a symptom, a chi- 
square based upon a single high cutting line will completely miss the relationship. 
Were the cutting line moved around to fit the obtained distribution for each symp- 
tom, a worthwhile description of the type-factor in terms of symptom occurrences 
associated with different ranges or levels of type-factor loading would result. How- 
ever, there is no longer‘any need to proceed in such a haphazard manner since du 
Mas"? introduced the systematized procedure for identifying relationships between 
symptom categories and such scale values as type-factor loading called Manifest 
Structure Analysis (MSA). 

Thus, the present study seeks to illustrate the bringing together of factor 
analytic typing and MSA description of the types. Type-factors are to be identified 
from a sample of the symptoms of chronic male schizophrenics, and MSA catescales 
of symptoms as they relate to each of the obtained type-factors are to be established 
to make it possible to calculate type-factor loadings directly from symptom data 
through the use of tables. Or, more grossly, the methodology to be illustrated is 
directed toward providing quantified reference information that can serve as diag- 
nostic criteria to provide homogeneity and reproducibility of Ss for studies relating 
variables to ‘‘schizophrenia”’. 


PROCEDURE 

The 49 patients of this study were male veterans with hospital diagnoses of 
schizophrenia. The upper age limit was 40. Almost none of the cases were first ad- 
missions. Most were readmissions to the VA Hospital, Knoxville, lowa, others had 
been previously hospitalized in other VA or state hospitals and were direct transfers 
or had failed in their efforts to readjust to the community. The clearly chronic 
nature of the sample would make it impossible to generalize data from this sample 
to an acute sample. Lobotomized patients and those suffering from post-EST con- 
fusion were excluded. Many were on ataractic drugs at the time of evaluation. The 
fiftieth subject was an hypothetical normal based upon the writer’s estimates of item 
scores for a normal individual in a hospital situation. 

The aim of the study, to identify types of schizophrenics, made it desirable to 
evaluate only those symptoms which were relevant to this differentiation. None of 
the most applicable rating scales“: ® * were designed to study schizophrenics ex- 
clusively, so a combination of items from all three was used to provide as many 
relevant items as possible. Nineteen of the originally selected items proved to have 
undesirable semantic features or their frequency of occurrence was higher than 90 
or less than 10 per cent.” 

2Copies of the orthogonal and oblique factor matrices, as well as the rating scale and tables for 
converting mean score values of individuals to factor loadings, have been deposited with the American 
Documentation Institute. Order Document No. 6559, remitting $1.25 for microfilm or $1.25 for 
photocopies. 
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All ratings were made by the author on a five-point scale during the first week 
after admission. Most of the information was obtained in an interview that lasted 
from 20 minutes to an hour. Additional information about ward behavior was ob- 
tained by interviewing a nursing assistant (attendant). Cutting lines were estab- 
lished and data transferred to keysort cards for calculating tetrachoric correlations 
between each of the subjects. The 50 X 50 intercorrelation matrix was factored by 
the multiple-group centroid method. The obtained factors were rotated to ortho- 
gonality and then to an oblique simple structure. 

The principles of manifest structure analysis“ were employed to demonstrate 
the empirical relationship between a given type-factor loading and the presence or 
absence of each of the symptoms. A catescale was established for each type-factor, 
taken one at a time, as follows: 


1. Ss were ranked from low to high by factor loadings, 


and became the ordinate on the scaling 
frame. 


Symptoms dichotomized into present or absent became the abscissa on the scaling frame in 
the ‘ir chance order of arrangement in the rating scale. 
Pins were inserted for a person in each cell of his designated row to correspond to the presence 
of that symptom represented by each particular column. 
Non-scalable symptoms were rejected by removing their column slats from the frame. Some 
were retained as reciprocal categories“. 
The remaining column slats were placed in ascending order by mean rank on the ordinate 
scale in order to approximate the intensive mathematical model (identity matrix). In this 
tate, the scaling frame constitutes an associational surface expressing the relationship be- 
tween categories and type-factor loading. 


A tabulation of the categories and their mean ranks now constitute the derived catescale. 


RESULTS 


Estimated communality was 70.7 per cent of the total variance, thereby in- 
‘ating that most of the symptom variance was identifiable in terms of the ob- 


TaBLe 1. MSA ScaLe RELATING SyMPTOM-CATEGORIES TO TyYPE-FACTOR: 
NORMAL (COEFFICIENT OF REPROCUCIBILITY - = .86)* 


Scale Value Brief Description of Symptom and Item Number 








315 Very aware of other patients (77) 
313 Very cooperative (68) 

310 No obsessive thoughts (21 —) 

303 Usually relevant (1) 

303 Not opinionated (36) 

280 Demands excessive attention (67) 
239 Can verbalize hostility toward patients (76) 
234 Sloppy appearance (63 —) 

223 Seldom looks at E. (57—) 

201 May dislike being told what to do (79) 
189 Persecutory feelings (17) 

189 Seeks little reassurance (49 —) 

176 Argumentative (52) 

176 May complain about others (69) 

172 Influenced by persecutors (6) 

168 Meaningless phrases (40) 

160 Bizarre thoughts (30) 

154 Irrelevant (1—) 

140 Incoherent (45 —) 

130 Generally hostile (7) 

128 Complains about locked doors (59) 
126 Grandiose (13) 

123 Some resentment of questions (46) 
116 Assertive (22) 

88 Stubborn about getting own way (53) 





*Cutting line of items arbitrarily determined as identifying the positive 
end of the type factor is set at 290. 
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TABLE 2. MSA ScaLeE RELATING SYMPTOM-CATEGORIES TO TYPE-FACTOR: 
CATATONIC WITHDRAWN VS. OvER-EXPRESSIVE 
(COEFFICIENT OF REPRODUCIBILITY = .94)* 

Scale Value Brief Description of Symptom and Item Number 


154 Mannerisms or gestures (5) 
142 Some positioning (2) 
121 Slowed movements (20 —) 
117 Place disorientation (16 —) 
93 Sensitive and considerate (24) 
92 Undertalkative (62—) 
88 No gesticulation (47 —) 
87 Time disorientation (3) 
77 Some fear of doing wrong (37) 
45 Very anxious in new situation (38) 
44 Seldom looks at E. (57—) 
—76 Very cooperative (68) 
—oe Optimistic (28) 
—89 No restriction in interests (65) 
—94 Influenced by persecutors (6) 
—103 Bizarre thoughts (30) 
—103 Can show initiative (78) 
—104 Emotionally labile (12) 
Recognizes illness (34) 
Initiates greetings to personnel (74) 
Overactive (23) 
Not perplexed (19 —) 
Very aware of other patients (77) 
Initiates conversation with personnel (70) 
Always responds promptly (73) 
Not undertalkative (62) 
Assertive (22) 
Generally hostile (7) 
Generally initiates conversation (64) 
Overtalkative (4) 


*Cutting line arbitrarily set at 100. 


TaBLe 3. MSA Sca.e RELATING SYMPTOM-CATEGORIES TO TYPE-FACTOR: 
PARANOID (COEFFICIENT OF REPRODUCIBILITY = .78)* 


Scale Value Brief Description of Symptom and Item Number 





Bizarre thoughts (30) 
Generally hostile (7) 
Influenced by persecutors (6) 
Can show hostility to personnel (71) 
Can verbalize hostility toward patients (76) 
Grandiose (13) 
Overtalkative (4) 
Stilted speech (39) 
Concern with abstractions (31) 
Meaningless phrases (40) 
Brags (51) 
Always responds promptly (73) 
May complain about others (69) 
Overconcerned about leaving (55) 
Optimistic (28) 
Underactive (60—) 
Not concerned with abstractions (31 —) 
No stilted speech (39 —) 
Expresses despondency (41) 
Seldom looks at E. (57—) 
Time disoriented (3) 
Sloppy appearance (63 —) 

141 Place disorientation (16 —) 

113 Moves hands manneristically (42) 


*Cutting line arbitrarily set at 360. 
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TaBLe 4. MSA ScaLe RELATING SYMPTOM-CATEGORIES TO TYPE-FAcTOR: 
REsIsTIVE ISOLATION (COEFFICIENT OF REPRODUCIBILITY = .83)* 


Scale Value Brief Description of Symptom and Item Number 


342 May dislike being told what to do (79) 

327 Underactive (60—) 

320 Social avoidance (66—) 

316 Not depe ndable (380 — ) 

292 Mannerisms or gestures (5) 

276 Sensitive and considerate (24) 

240 Some fear of doing wrong (37) 
Very cooperative (68) 
Recognizes illness (34) 
Dependable (80) 
Can verbalize hostility toward patients ( 
Some fear of doom (27) 
Overconcerned about leaving (55) 
Expresses despondency (41) 
Initiates conversation with personnel (70) 
Always responds promptly (73) 
Time disorientation (3) 
Generally initiates conversation (64) 
Some resentment of questions (46) 
May complain about others (69) 
Criticism of hospital (56) 
Irritable (25) 
Overtalkative (4) 
Assertive (22) 
Generally hostile ( 
Overactive (23) 
Stubborn about getting own way (53) 


6) 


) 


*Cutting line arbitrarily set at 300. 


TaBLe 5. MSA ScaLe RELATING SYMPTOM-CATEGORIES TO TYPE-F ACTOR: 
Anxious-DyspHoric (COEFFICIENT OF REPRODUCIBILITY = .90)* 


Scale Value Brief Description of Symptom and Item Number 


368 Expresses despondency (41) 
Recognizes illness (34) 
Overconcerned about other’s opinion (29) 
Some fear of doom (27 
Difficult to make decisions (35) 

No silly smiling (44—) 
Sensitive and considerate (24) 
Residual ability not low (33) 
Not perplexed (19 —) 

Social avoidance (66—) 

Brags (51) 

207 Stilted speech (39) 

207 Opinionated (36) 

191 Irrelevant (1—) 

184 Ce nerally hostile (7) 

183 Generally initiates conversation (64) 

176 Overtalkative (4) 

170 Complains about locked doors (59) 

169 Grandiose (13) 

167 Optimistic (28) 

163 Can show hostility to personnel (71) 

159 Incoherent (45 —) 

150 Demands excessive attention (67) 

148 Silly smiling (44) 

142 Meaningless phrases (40) 

140 Stubborn about getting own way (53) 

134 Assertive (22) 

116 Sloppy in appearance (63 —) 





*Cutting line arbitrarily set at 310 
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tained type-factors. The factors accounted for 96.8 per cent of the estimated com- 
munality. The factor matrix is omitted to conserve space since there is little value 
in knowing that a particular individual had the obtained loadings on the factors.* 
It will suffice to know that all factors were well represented by having many heavily 
loaded subjects for each. Only the second factor was bipolar, and strongly so. 

The types are described in Table 1 through 5 in terms of the manifest structure 
analysis (MSA) relationship of symptom-categories to type-factor loadings. The 
mean score value for each relevant symptom-category is given in the tables.* Thus, 
a mean of the weighted values for the symptoms encountered in a patient over a 
particular MSA scale provides an index of his factor loading for that type. The mean 
scores can be converted to type-factor loading estimates from tables.?, Attempts to 
reproduce the factor loadings on a particular type-factor from symptom occurrences 
for the Ss in this study led to the rank-difference correlations of reproducibility 
found at the heading of each table. 

Item identification numbers, followed by a minus in the first five tables, in- 
dicate that the items were reversed in direction during scaling (reciprocal categor- 
ies. “: p. 145 


DISCUSSION 

Before referring to the tables of catescales it will be helpful to consider some 
of their features that are quite different from tables of factor loadings, thus mini- 
mizing likely misinterpretations. Unlike factor loading values, the MSA catescale 
values are simply points on an arbitrary rank-order scale; they have no zero point 
and do not represent the degree of variance of the category associated with the 
criterion. Actually, a category with a high scale value may show less variance asso- 
ciated with the criterion than those categories with mid-range scale values. The 
only implication of a catescale value is that it is the mean point on a rank-order 
scale indicating the type-factor loading associated with it. If the catescale value is 
very high, then individuals with very high type-factor loadings are ‘‘plus’’ scorers 
for that category. But note that the scaling requirements of minimum scatter (for 
the intensive model) around the stated scale point result in the rejection of many 
categories with high scale values because the scatter extending well below the middle 
range of type-factor scores makes them unreliable indicators of the mean scale value. 
In practical terms, this means that such a prominent symptom as “‘suspicious’’ may 
be absent from a paranoid scale not because it is usually not encountered in in- 
dividuals of that factor-type but because it does not discriminate between those in 
the midrange and those high on the factor-type. This is quite different from factor 
analysis where the absence of an item from a list of high loadings implies that it is 
not associated with that factor. Categories not included in the catescales given here 
may prove to be very good differentiae between catescales, but in the MSA pro- 
cedure they are selected only because they are good indicators of the magnitude of 
the particular type-factor for the subjects. 

Also, unlike factor loadings, the size of the catescale values are independent of 
the reliability of the item. While a low factor loading may be obtained for a factor- 
ially pure, but unreliable, item, this does not happen in MSA. Throughout the 
catescale range, items are of comparable reliability, and items of low reliability have 
automatically been rejected because they will not scale. For this reason, the re- 
liability of any category, representing the degree of type-factor loading, is inde- 
pendent of its catescale position. The big value of MSA lies herein, that we do not 
have to depend upon a limited number of reliable categories at the top end of the 
type-factor scale to designate the magnitude of a person’s factor loading. 

Yet our habit of looking at higher values in factor matrices to determine the 
nature of the underlying factor is most applicable to MSA. In fact, we must look at 

3Mean score values for categories given in the tables are mean ranks on the Y axis of the scaling 


frame multiplied by 10, except for Table 2, where 260 was subtracted from each value to indicate the 
true zero point. 
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only the very highest scale values in a catescale to identify it since those below it are 
lower in scale value because of the intrusion of other factors. They cannot be factor- 
ially pure items whose loadings have been lowered merely because of poor reliability, 
as is often the case with items in factor matrices. It follows that identification of 
factors is more difficult with catescales than for factor matrices. With these pre- 
cautions in mind we can turn to a discussion of the catescales obtained. 


The Normal type-factor seen in Table 1 shows that the five highest (first- listed) items reflect 
a healthy, flexible adjustment. The hypothetical normal “subject” had a .77 factor loading on 
this type. As we go down the category scale in Table 1 we note the intrusion of paranoid features 
to destroy the purity of typing for subjects. It is not surprising that this factor and the paranoid 
tvpe-factor showed a correlation of .43, the largest in Table 6.2 This factor corresponds closely to 
the normal factor found in an earlier study “ of a similar sample of Ss. 


The Catatonic Withdrawn vs. Over-Expressive bipolar type-factor seen in Table 2 shows the 
three highest items to be very prominent symptoms of catatonic stupor. Disorientation is also 
important. Near the zero point (low scale values) the picture of the sensitive catatonic person- 
ality remains but the extreme withdrawal has disappeared and there is a close correspondence 
to the anrious-dysphoric type. Moving down toward the negative end of the scale, some paranoid 
features appear. Continuing further down, contact with reality becomes hyperacute and hostility 
and over-expressiveness become marked. The extreme negative end of the scale may re present 
the excited catatonic condition but the data do not provide clear information on this point. A 
similar factor appeared in earlier studies 


The Paranoid type-factor seen in Table 3 shows the seven highest items to be the outstand- 
ing characteristics of paranoid schizophrenics. Egocentricity, hostility, overexpressiveness and 
delusional thinking are all prominent. Only at the low end of the scale do regressive and with- 
drawal symptoms appear. The factor appears to represent a fairly pure type at the higher end, 
and the degree to which a paranoid individual would conform to the type would depend upon 
the degree to which the paranoid symptoms are present rather than through an intrusion of other 
type-factor symptoms. Of course, as there is a remission of paranoid symptoms, the individual 
would revert to the normal type, but, more frequently, there is a residual of character prob- 
lems and he more closely resembles the anzious-dysphoric type. A similar factor was found by 
Guertin ®?, 

The Resistive Isolation type-factor seen in Table 4 shows the four highest items to be a 
combination of inactivity and a resistance to any attempts to stimulate activity in these in- 
dividuals. The picture changes very quickly as we go down the scale and the resistiveness is re- 
placed by cooperation and the individuals appear to be sensitive and have insight. At that point, 
the individuals have many of the features of the normal and anrious-dysphoric type-factors, so 
it is not surprising to find that these three types have moderately high intercorrelations. The 
existence of this factor is confirmed in two earlier studies. : * 

The Anzxious-Dysphoric type-factor seen in Table 5 shows that the five highest items reflect 
a combination of insecurity and sensitivity. Anxiety and tension are present which lead to sub- 
jective feelings of unhappiness. Conflict and perhaps guilt are present. The high end of the scale 
discloses no clearly psychotic items and, except for its neurotic flavor, is similar to the normal type. 
The problems appear to be neurotic ones and in our population of psychotic patients this probably 
represents a remission of psychotic symptoms with the persistence of general inadequacy and 
conflict. In the middle of the scale, paranoid thought disturbance and hostility intrude to pro- 
duce a mixture of neurotic and psychotic symptomatology or a pseudoneurotic picture. A similar 
factor appeared in the earlier study. “ 


METHODOLOGICAL DISCUSSION 


The data presented herein are intended to illustrate the methodology. As in 
most pilot studies, there are many neglects that must be given proper attention in a 
more thorough investigation. Some of these are: no cross-validation of the type- 
factors or the MSA catescales, incomplete description of the rating scale used and 
no demonstration that other raters would view the symptoms in the same way as 
did this rater, sample restriction to chronic male veterans limiting the general ap- 
plicability of the types and catescales, as well as many less important weaknesses. 
Despite these lacks, the empirical findings may prove useful for research purposes 
with a sample similar to that employed. At any rate, no claim is made as to the 
establishment of a sound classificatory system. The whole task of differential 
diagnosis has been ignored in this study, but the methodology proposed and illus- 
trated herein does show promise. 

Three of the catescales showed lower coefficients of reproducibilit? than desired. 
The lowest value is .78, but upon cross-validation of the catescale it will probably 
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become apparent that much of the variance in type-factor predicted by the catescale 
has been lost. There is hope for improving reproducibility and reliability by employ- 
ing the full five points of the symptom rated, each as a separate category, instead of 
simply dichotomizing as in this study. 

In closing, it would be well to view the unique features of this combined factor 
analysis—MSA methodology in relation to other typological procedures. It possesses 
uniqueness of solution that is not characteristic of linkage analysis. The only other 
method leading to unique typing is Lazarsfeld’s®) latent structure analysis, which 
leads to the determination of and assignment of individuals to latent classes. It has 
had limited application because of its computational complexity, but more im- 
portant, it is applicable only to undimensional problems involving a very few cate- 
gories and they are customarily dichotomized. In contrast to this, the present 
methodology is more generally applicable since it provides typing of individuals on 
multidimensional characteristics, can quickly screen hundreds of categories for 
relevance to typing, and is capable of employing continuous variables. 


SUMMARY 


The problems of sampling and describing subjects in studies of schizophrenics 
are discussed and typology is proposed as a necessary solution. Transposed factor 
analysis provides a unique solution to typing schizophrenics but there are many 
difficulties encountered in relating the symptoms to obtained type-factors since 
they disappear in the process of intercorrelating individuals. 

The present study employs manifest structure analysis to establish ‘‘catescales’’ 
of symptoms, so that the type-factors have a clear description that can be used for 
classificatory purposes through quantified methods. The combination of factor 
analysis and manifest structure analysis provides a methodology that can produce 
quantified classificatory criteria for establishing homogeneous and reproducible 
samples of schizophrenics. 

The five obtained type-factors are: the normal, the catatonic withdrawn vs. 
over expressive, the paranoid, the resistive isolation, and the anzious-dysphoric types. 
Sampling limitations of 'this pilot study are discussed. 
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VALIDITIES OF THREE VISUAL TESTS FOR DIFFERENTIATING 
ORGANICS FROM SCHIZOPHRENICS AND NORMALS 
BERT R. SAPPENFIELD AND ROBERT J. RIPKE 


Montana State University Montana State Hospital 


PROBLEM AND METHOD 


This study compared the relative effectiveness of the spiral aftereffect test and 
two other tests (a visual test and a visual-motor test) for differentiating three groups 
previously defined as organic, schizophrenic, and normal. 


Subjects. The organic Ss (21 men and 9 women, patients at Montana State 
Hospital) had medical diagnoses of chronic brain syndrome, without mental de- 
ficiency or psychosis (8 convulsive disorders, 6 with central nervous system syphilis, 
3 Korsakoff’s syndrome, 3 with cerebral arteriosclerosis, 6 with diseases of unknown 
or uncertain causes, and one each with senile brain disease, brain trauma, prenatal 
brain damage, and other cranial infection). The age range was from 22 to 70 years. 

The schizophrenic Ss (21 men and 9 women, patients at Montana State Hos- 
pital) were selected from a complete list of medically diagnosed schizophrenics, so 
that each S was matched with an S in the organic group on the variables of sex, age, 
and educational level. 

The normal Ss (21 men and 9 women) were hospital employees and other 
available Ss without known organic brain pathology or psychosis. Normal Ss were 
also matched with organic Ss on the variables of sex, age, and educational level. 


Test procedures. All test stimuli were illuminated by two 60-watt bulbs placed 
at distances of 1 ft. on either side. Each test was preceded by the general instruction: 
‘This is a special eye test. Keep both eyes open.’’ Each S was given tests in the 
following order: (a) retinal rivalry, trial 1; (b) stereoscopic figure tracing, trial 1; 


(c) stereoscopic figure tracing, trial 2; (d) retinal rivalry, trial 2; (e) spiral aftereffect, 
with spirals presented in the order A, B, C. 

Spiral aftereffect test. Three 90° spirals mounted on 8-in. discs were rotated at 78 rpm. 
Spirals A and C appeared to expand while rotating, and induced contracting aftereffects; spiral 
B appeared to contract while rotating, and induced an expanding aftereffect. S was seated 4 ft 
from the dise and instructed as follows: ‘‘Look at the center of the spiral, and don’t take your 
eyes away until you are told to do so.’”’ After the spiral was rotating: ‘‘What do you see?” If no 
answer or an ambiguous answer was given: ‘‘Which way is the line moving?’”’ Upon obtaining 
an answer: ‘Now watch closely; when I turn the machine off, I want you to tell me what the 
line does.” After 30 seconds, the disc was stopped, and E asked: ‘‘Now, what is the line doing?”’ 
S was given a positive score for each trial with each spiral on whic h he re ported perceiving 
the expected after-effect. 

Retinal rivalry test. The apparatus was a Brewster stereoscope mounted vertically above a 
platform on which the test stimulus was placed. The test stimulus consisted of two 21% in. x 
314 in. rectangles mounted on a 4 x 6 card; the left-hand rectangle was red, the right-hand rect- 
angle green; both rectangles were ruled with black lines, spaced approximately 3/8 in. apart, 
parallel to and including the diagonal from the lower outside corner to the upper inside corner. 
While S looked into the stereoscope, the retinal rivalry stimulus was inserted, with the instruction: 
“Tell me when the card you are viewing changes.’’ The score consisted of the total number of 
color and/or line changes reported during two 30-second periods. 

Stereoscopic figure-tracing test. The apparatus was the same as for the retinal rivalry test. 
The test stimulus consisted of a 4 x 6 card, on the left (or right) side of which a cross, a square, 
and a diamond had previously been drawn, arranged vertically in relation to one another. The 
cross consisted of two 1-in. perpendicular lines intersecting at the midpoints; the square measured 
1 in. on each side; the diagonals of the diamond measured approximately 1 1/4 in. and 3/4 in., 
respectively. The right (or left) side of the card was blank. The card was inserted under the 
stereoscope so that the figures appeared on the side of the dominant eye; and, while S looked into 
the stereoscope, instructions were given to “trace over these figures with your pencil.”’ Scoring 
was pe rforme “d inde ‘pendently by three judges, using the scoring standards of Terman and Mer- 
ril]“ pp. 207-2 230-282). the assigned score for each figure on each trial depended on agree- 
ment between at least two of the three judges. Since, possibly due to extremely lenient scoring 
standards, only 11 Ss out of a total of 90 in the three groups failed to achieve passing scores on 
both trials with the cross, this figure was eliminated from consideration both as a subtest and as 
a contribution to total score. Scores consisted, then, of the number of successfull reproductions, 
in two trials, of each figure (square and diamond) and of both figures combined. 





VALIDITIES OF THREE VISUAL TESTS 


RESULTS 


The data of Table 1 indicate that each of the tests, and each of the subtests, 
discriminated between the organic group and the normal group, at beyond the .01 


TABLE 1. COMPARISONS OF ORGANICS WITH NORMALS ON ALL VARIABLES 


Normal Organic 
(N = 30) (N = 30) 


Variable 


Spiral aftereffect 

Spiral Aa 2 : 7.3 <.01 
Spiral Ba 2° : 7 <.001 
Spiral Ca ‘ <.01 
Total score f 00007 


Retinal rivalry' 2s < 001 
00003 


Stereoscopic figure tracing 
Figure B» ) : 9.08 c .01 
Figure C> : ¢ 8.07 <.01 
Total score 166.0 < .00003 


Note: Entries in second and third columns denote numbers of Ss giving successful or above- 
median performance. All Chi-squares corrected for continuity. 


@Dichotomized in terms of reporting or not reporting aftereffect. 
>Dichotomized as near as possible to median of both groups combined. 


level of significance. Comparison of the tests and subtests, in terms of their relative 
discriminating powers, reveals (a) that spiral B (having an expanding aftereffect) 
discriminated the two groups more successfully than did spirals A and C (having a 
contracting aftereffect), (b) that the retinal rivalry test discriminated the groups 
far more successfully than either of the other tests, and (c) that the stereoscopic 
figure-tracing test compared favorably with the spiral aftereffect test in its ability 
to discriminate between organics and normals. The total scores for each of the tests 
distinguished between the normal and organic groups, at extremely low probability 
levels (.00003 to .00007), as evaluated by the Mann-Whitney U test. 

Comparisoa of the normal group with the organic group (Table 1), in terms of 
the relative frequencies with which successful or above-median performance oc- 
curred, reveals that each spiral of the spiral aftereffect test owed its discriminating 
ability more to its success in classifying normal Ss correctly than to its success in 
classifying organic Ss correctly. The retinal rivalry test, however, was highly suc- 
cessful in classifying correctly the Ss in both groups. Figure B of the stereoscopic 
figure-tracing test was more successful in classifying organics than in classifying 
normals, while figure C showed moderate, but equivalent, success in classifying the 
Ss in both groups. 

The data of Table 2 indicate that two subtests (spirals A and C) of the spiral 
aftereffect test and both subtests of the stereoscopic figure-tracing test failed to 
discriminate significantly between the organic and schizophrenic groups, although 
the total scores for each of the tests, as evaluated by the Mann-Whitney U test, 
were successful in discriminating between the two groups at acceptable levels of 
significance. Spiral B of the spiral aftereffect test also discriminated between the 
schizophrenics and the organics, at an highly acceptable level of significance; again, 
however, its discriminating ability was attributable mainly to its success in classify- 
ing Ss in only one of the two groups, the schizophrenic group. The retinal rivaly 
test, on the other hand, misclassified only two schizophrenics and four organics, 
on the basis of a cutting score at the median for both groups combined. 

Tabular data are omitted for the comparison of schizophrenics with normals, 
since schizophrenic Ss performed at levels approximating the performance of normal 
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TABLE 2. COMPARISONS OF ORGANICS WITH SCHIZOPHRENICS ON ALL VARIABLES 


Schiz. Organic 
Variable (N = 30) (N = 30) 


Spiral aftereffect 
Spiral Aa 21 - ; 10<.20 
Spiral Ba 28 K { .0O1 
Spiral Ca 21 K 05<.10 


Total score .001 


Retinal rivalry' 28 - 35.4: <.001 
- .00003 


Stereoscopic figure tracing 

Figure B' > .10<.20 
} igure Ct ‘ y >. b.<.B 
Total score <.01 





Note: Entries in second and third columns denote numbers of Ss giving successful or above- 
median performance. All Chi-squares corrected for continuity. 


 Dichotomized in terms of reporting or not reporting aftereffect. 
>Dichotomized as near as possible to median of both groups combined. 


Ss on both the spiral aftereffect test and the stereoscopic figure-tracing test, both in 
terms of subtest scores and in terms of total scores. The only test which discrim- 
inated significantly between the schizophrenic and normal groups was the retinal 
rivalry test, on which 19 normals and 10 schizophrenics achieved above-median 
scores, when the median was based on the scores of the two groups combined. The 
resulting Chi-square value was 4.27 (significant at the .05 level), while the value of 
U was 248.5 (significant at the .002 level). 


SUMMARY 


This study investigated the relative effectiveness of the spiral aftereffect test, 
a retinal rivalry test, and a stereoscopic figure-tracing test for discriminating be- 
tween groups previously defined as organic, schizophrenic, and normal. 

Each of the tests, and each of the subtests, discriminated between organics and 
normals at acceptable levels of significance. Total scores on each of the tests, but 
only one subtest (spiral aftereffect involving perception of expansion), also dis- 
criminated at acceptable levels of significance between organics and schizophrenics. 
The only test which discriminated significantly between schizophrenics and normals 
was the retinal rivalry test. The retinal rivalry test, as compared with the other two 
tests, was found to classify correctly much larger proportions of Ss in the different 
groups. 
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THE USE OF DRAWINGS OF HANDS AS A PROJECTIVE MEDIUM FOR 
DIFFERENTIATING NORMALS AND SCHIZOPHRENICS 


EDWIN E. 


WAGNER 


Unive rsity of Akron 


METHOD 


Recent work in the prediction of anti-social 
behavior has suggested that drawings of human 
hands may serve as useful projective devices for 
assessing personality.!. This exploratory study 
was concerned with the possibility of differ- 
entiating a “known’’ clinical group, schizo- 
phrenia, from a normal population utilizing this 
new projective approach. 

The subjects were 50 diagnosed male schizo- 
phrenics randomly selected from Massilon State 
Hospital, Massilon, Ohio, and 50 male college 
students from Akron University. The mean age 
of the schizophrenic group was 42.5, 8. D. 17.6 
The normal group had a mean age of 20, 8S. D 
1.3. The schizophrenics were diagnosed as 10 
chronics; 8 undifferentiated; 6 simple; 5 cata- 
tonic; 7 hebephrenic; 20 paranoid. 

The stimuli consisted of 10 cards approximate- 
ly 4 by 5 inches. On each of these cards except 
the last a hand was drawn in a recognizable but 
ambiguous pose. Each subject was asked to 
“tell what the hand is doing’’. The last card is 
blank and the subjects were asked to “imagine a 
hand doing something’. The responses were 
recorded verbatim by the examiner. The 
sponses were analyzed and scored as follows: 


PROBLEM AND 


re- 


1. An interpersonal response: another per- 
son is explicitly or implicitly involved in the 
action, e.g., “‘shaking hands’’, “holding up 
traffic’’, “fighting someone’’, ‘talking’, ‘“‘danc- 
ing’’. 


TABLE 1. 


2. An inactive hand is resting or 


posed, little or no energy is exerted with at- 
tendant lack of interpersonal and/or environ- 
mental accomplishment, e.g., ‘‘hand resting on 
a table’, “holding fingernails up to dry”, 
“‘sleeping”’ 


response: 


3. An active movement of a non- 
interpersonal nature, e.g., “‘picking up a coin’’, 
“reaching for something’’, ‘throwing a ball’. 


response: 


4. A maladjustive response: tension, pain, 
disfigurement or despair is projected into the 
response, @.g., “all clenched up’, “erippled’’, 
“got sores on it’, “drowning’’, ‘paralyzed’, 
“dead’’, ‘‘hanging from a ledge’”’ 


5. A withdrawal response: an inability to re- 
spond meaningfully to the card which pre- 
sumedly indicates some degree of pathological 
withdrawal from interpersonal and/or environ- 
mental situations. Withdrawal responses Can 
be subdivided into Description, Failure and 
Bizarre responses 
a. Description: the presence of the hand 
is acknowledged without imparting a 
meaningful action tendency, e.g., “it’s 
a hand’, ‘‘a left hand’, ‘‘a hand with 
five fingers’’. 


1As reported by Z. A. Piotrowski at the Inter- 
national Congress of Psychology in Bonn, 
Germany, and reviewed in Science Newsletter for 
August 13, 1960, page 109. 


DISTRIBUTION OF RESPONSES, SIGNIFICANCE LEVELS AND CORRELATIONAL VALUES FOR 


THE NORMAL AND SCHIZOPHRENIC GROUPS ON EacH SCORING CATEGORY 


Number of Individuals in Each Group Producing a Given Number 


Number of 
Responses 
per 
Protocol 


Inactive 
Norm 


Interpersonal 
Norm. Schiz. 


24 
8 
9 
6 
9 


One 


> 


0 
l 
0 
0 
0 
0 


one 


oon 


Total 50 


50 


<.001 


.56 


Median Test 


Phi Values 


Schiz. 


of Responses Per Protocol for Each Scoring Category 


Active 
Norm. Schiz. 


Withdrawal 
Norm. Schiz. 


Maladjustive 
Norm. Schiz. 


0 19 é 36 36 i) 
y 5 ] 


6 9 2 3 
9 7 K : 11 
10 6 ; 6 
12 7 l ‘ ) 
9 0 

4 0 

0 0 

0 ; 
0 
0 
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Failure: subject is unable to respond 
it all to a particular drawing 


Bizarre: a response completely out of 
keeping with the drawn contours of the 
hand, e.g., ‘‘a big black bug creeping 
along the wall’, ‘“‘the hand of God with 
the tree of life growing out of it’. 


The above categories were predetermined by 
the author because of what seemed to be their 
logical a priori projective connotations and the 
simplic ity of scoring. Asa control, the protocols 
were administered and scored by a graduate 
student, Captain Ford Smart, U. 8. A. F., who 
had no knowledge of the interpretive significance 
of the scoring categories. For each subject, a tally 
was made of the number of responses produced 
in each category. Median tests for significance 
and Phi tests of correlation were then computed 
between the experimental and control groups for 
each major scoring category. Since there was an 
obvious disparity between the ages of the two 
groups, median tests were also computed to 
establish the significance, if any, between age and 
each diagnostic scoring category for the schizo- 
phrenic group. This was not done for the normal 
group because of the similarity in ages 


RESULTS AND DISCUSSION 
There were no significant differences on the 
median test between age and scoring categories 
for the schizophrenic groups suggesting that any 


WAGNER 


differences between the experimental and control 
group were attributable to factors other than age. 
Table 1 summarizes the distribution of responses 
on each scoring category for the schizophrenic 
and normal groups with significance levels and 
correlational values. All scoring categories were 
significant beyond the .001 level with the excep- 
tion of the inactive category. Maximum differ- 
entiation was obtained from the withdrawal cate- 
gory which produced a correlation of .81 between 
the presence of withdrawal responses and schizo- 
phrenia. 

Apparently, the use of hand drawings as a pro- 
jective medium for differential diagnosis should 
be pursued further on an experimental basis since, 
notwithstanding the obvious limitations of this 
study, significant differentiation has been ob- 
tained between the normal and schizophrenic 
groups. If the connotations of the scoring cate- 
gories are taken literally, a picture of the schizo- 
phrenic emerges which is in keeping with what is 
known about their overt behavior: reduction in 
interpersonal relationships, lowering of activity 
level, maladjustive behavior, and withdrawal 
from reality situations. The average administra- 
tion time in this experiment was only about five 
to ten minutes per subject. 


SUMMARY 
Significant differentiation was obtained be- 
tween a normal and schizophrenic group using 
drawings of hands as an experimental projective 
device. 


SHORT FORMS OF THE WISC FOR USE WITH EMOTIONALLY 
DISTURBED CHILDREN' 
RICHARD ENBURG’, VINTON N. ROWLEY, AND BETH STONE 


State University of Iowa 


In a clinical setting, the psychologist frequently is faced with the problem of 
obtaining the maximum amount of information in a minimum amount of time. Often 
information is desired regarding the intellectual functioning of the child as well as 


his personality dynamics. Particularly with the emotionally disturbed child, a 
question about his intellectual ability may arise, as the presenting problem fre- 
quently centers on his inability to adjust in the school situation. This raises the 
problem of determining whether the child is having academic difficulty associated 
with intellectual factors, emotional factors or both. If consideration of time enters 
into the selection of testing materials, the psychologist must select a relatively brief 
test of intellectual functioning in order to devote more time to personality evalua- 
tion. 


iThe authors are indebted to the staff of the Massachusetts Institute of Technology Computation 
Center and International Business Machines for their assistance in processing the data for this study. 
3Now at University of Connecticut 
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The Wechsler Intelligence Scale for Children (WISC) is particularly adaptable 
to this type of problem, as the clinician may select only particular subtests for ad- 
ministration. However, when only selected subtests are administered, the reliability 
of the prorated quotient is questionable. Several approaches have been used to de- 
termine the reliability of such shortened batteries. One method is to determine re- 
liability as a function of the correlation between the subtests utilized and the Full 
Scale intelligence quotient. Bridges“) published a nomograph based upon the inter- 
correlations between the subtests and the intelligence quotient of Wechsler’s stand- 
ardization sample. His graphs furnish estimates of the reliability of various combina- 
tions of subtests. As he mentions, the graphs are applicable only to the population 
on which the scale was standardized. 

Another approach to developing short forms of the WISC involves preselecting 
certain combinations of subtests and determining the correlations with the Full Scale 
quotient. Carleton and Stacey “) obtained correlations ranging from .64 to .88 for 21 
such combinations. Their sample consisted of children evaluated for possible mental 
deficiency, without organic impairment. The applicability of these short forms for 
emotionally disturbed children ts somewhat questionable in light of the findings of 
Yalowitz and Armstrong"). Utilizing various combinations of five subtests, they 
obtained correlations in the order of .55 to .61 between the abbreviated scale and the 
complete examination of children referred to a child guidance clinic. Another in- 
vestigation by Simpson and Bridges“? found that the sum of Vocabulary and Block 
Design correlated .874 with the total weighted score for child guidance cases. In the 
above studies the short forms investigated have been selected either on the basis of 
the original correlations obtained by Wechsler or the investigator’s preference for 
certain subtests—either of which may have introduced bias. 

An alternative approach to evaluating intelligence is to employ some instru- 
ment which is shorter or more convenient to administer than the entire WISC, yet 
which correlates well with the Full Scale quotient. Among the tests investigated 
have been the Kent EGY, Raven’s Progressive Matrices (RPM), Raven’s Colored 
Progressive Matrices (RCPM), the SRA Primary Mental Abilities Test, the Cali- 
fornia Test of Mental Maturity (CTMM), and the Columbia Mental Maturity Scale 
(CMMS). Barratt” found that the RPM correlated .754 with the WISC Full Scale 
quotient while the CMMS had a correlation of .606. On this basis he proposed that 
the RPM be used as an alternative to the WISC rather than the CMMS. Martin 
and Wiechers‘’’ found that the RCPM correlated .91 with the Full Scale quotient of 
the WISC. Both of the above studies were based on a sample of school children. 
Stacey and Carleton® obtained a correlation of .55 between the RPCM and the 
WISC Full Scale quotient with children evaluated for possible mental deficiency. 
Stemple“"?, using children of superior social-economic level, found a correlation of 
.58 between the WISC Full Scale quotient and the SRA quotient. Delp“? obtained 
a correlation of .618 between the WISC Full Scale quotient and’ the Kent EGY. He 
concluded that this scale should not be utilized as an evaluative instrument in prefer- 
ence to the subtests of the WISC as they had better correlations with the Full Scale 
quotient than the Kent EGY. 

In the above studies the subjects were not classifiable as being emotionally dis- 
turbed children. Since the degree of comparability among emotionally disturbed 
children, school children and mentally deficient children is unknown, it should not 
be assumed that findings based on one group can be generalized to another. In addi- 
tion there is the problem of relative efficiency. If some combination of subtests from 
the WISC can be found which correlates as well or better than the above alternative 
tests, while not requiring more time, it would appear that the shortened WISC being 
more familiar to the clinician, would be a more appropriate instrument. The purpose 
of the present investigation was to derive short forms of the WISC such that (a) 
administration time could be lessened without appreciably reducing the reliability, 
and (b) previous sources of possible bias from preselection of subtests or utilization 
of Wechsler’s original sample would be eliminated. 
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METHOD 


The population for this study consisted of children referred to the Child Psy- 
chiatry Service, Psychopathic Hospital, State University of lowa, during the years 
1957 to 1960. All cases were selected to meet the criteria of being without a primary 
diagnosis of organic involvement or mental deficiency (defined as having a Full Scale 
quotient of less than 70 on the WISC) and with some form of emotional difficulty. 
he mean age was 11.29 years, range 5-7 to 15-11; both males (N = 118) and females 

N = 27) were included for a total sample of 145 cases. All Full Scale quotients were 
computed on the basis of ten subscales, with Mazes and Digit Span being omitted. 

The basic information sought was a series of correlations between various com- 
binations of the subtests of the WISC and the total weighted score. Since one con- 
sideration for developing such a shortened battery was a minimization of time in- 
volved in administration, it was decided to evaluate only combinations of three, four 
and five subtests. In order to avoid any investigator preference in regard to the 
selection of the various combinations, it was decided to form all the possible different 
combinations of subtests for a given size of the short form. Thus for a short form 
which includes only three subtests, there are 120 possible different combinations. 
For the four subtests form there are 210 possible combinations, and for the five sub- 
tests form 252 combinations are possible. The above information involved forming 
the sum of the sealed scores for the various possible combinations and then correlat- 
ing these with the total scaled sum for the entire battery, resulting in 582 correlations. 

In utilizing the above procedure, two possible factors which might bias the re- 
sults were avoided. The first is the assumption that the correlations between the 
subtests and the Full Scale quotient would be the same for a population of emotion- 
ally disturbed children as was found for Wechsler’s standardization sample. Second- 
ly, preferential selection of combinations of subtests is avoided. 


LESULTS 
The mean Full Scale IQ of the present sample was 97.18 (SD = 15.41), the mean 
Verbal Scale IQ was 97.26 (SD = 16.08), and the mean Performance Scale 1Q was 
97.61 (SD = 15.54). The differences between the means were obviously not sig- 
nificant. The sum of the Verbal scaled scores correlated .90 with the sum of the Full 
TABLE 1. CORRELATIONS BETWEEN COMBINATIONS OF 
THREE SUBTESTS AND THE SUM OF FULL SCALE SCALED 
SCORES 


Subtests 


. : BD 
A, BD 
, BD 


scaled scores, while the sum of the Performance scaled scores correlated .85 with the 
sum of the Full scaled scores. These results are comparable to those reported by 
Wechsler “®? for his standardization sample. 

The correlations of three, four and five subtests ranged in value from .77 (PC, 
OA, Cod) to .96 (C, A, 8, BD, Cod). It is not surprising that when the number of 
subtests was increased the correlated values obtained were also enhanced. For 
example, PC, OA and Cod correlated .77, but with the addition of BD the correlation 
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TABLE 2. CORRELATIONS BETWEEN COMBINATIONS OF 
FouR SUBTESTS AND THE SUM OF FULL ScALE SCALED 
SCORES 


Subtests R 
94 

94 

OF 

94 

94 

, BB, Cod 94 
, BD 94 

94 

94 

.94 


rose to .82. When PA was added to this combination of four subtests the correlation 
increased to .85. 

The ten combinations of three, four and five subtests which correlated most 
highly with the sum of the weighted scores are presented in Tables 1, 2 and 3. Only 
ten combinations are listed for each size of the shortened test, since it is probable 
that they will be of most interest to the clinician. 


TABLE 3. CORRELATIONS BETWEEN COMBINATIONS OF FIVE 
SUBTESTS AND THE SUM OF FULL SCALE SCALED SCORES 


Subtests R 
re BD, Cod 
PA, BD 96 
PC, OA 96 


PC, BD 95 
BD. Cod 95 
ea PA, BD 95 
V, PC, BD 95 
i BD, OA 
c S, PA, BD 


S, 
‘, 
A, PC, BD 
V, 
S, 


DIscUSSION 

The results indicate that a few subtests of the WISC may be used to assess in- 
tellectual functioning without greatly reducing the reliability of the obtained quo- 
tient. In this respect, the results are comparable to those reported by Carleton and 
Stacey “) and Simpson and Bridges“, but are in contrast to those obtained by Yalo- 
witz and Armstrong“). This latter discrepancy may reflect sampling differences 
since their sample also included organically impaired and defective children. In add- 
ition, Yalowitz and Armstrong mentioned that their population showed considerable 
intra-test scatter and a larger standard deviation on the Performance Scale than on 
the Verbal Scale, whereas the present study did not reveal similar variability. 

One possible objection to the use of any short form of the WISC is that the 
clinician loses diagnostic information regarding the child’s functioning on those sub- 
tests which are omitted. Although this information may be desirable in some in- 
stances, recent investigations by Cohen“? and Rowley? cast some doubt on the 
validity of utilizing differential performance on the subtests as a diagnostic pro- 
cedure. The present findings lend credence to the use of a few subtests to evaluate 
intellectual functioning of emotionally disturbed children. They do not, however, 
justify the use of a short form in cases where the child presents problems of organic 
involvement and or mental retardation. In such cases, the results of Carleton and 
Stacey “) are more aptly applied. 
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SUMMARY 


The purpose of the present investigation was to derive short forms of the 
WISC for use with emotionally disturbed children, such that administration time 
could be lessened without appreciably reducing reliability and previous sources of 
possible bias from preselection of subtests or the utilization of Wechsler’s original 
sample would be eliminated. All combinations of three, four and five subtests of the 
WISC were correlated with the sum of the scaled scores for the Full Scale of 145 
children whose diagnosis was emotional disturbance. It was concluded that selected 
subtest combinations could reliably estimate the intellectual functioning of emotion- 
ally disturbed children. 
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THE CEILING OF THE NEW STANFORD-BINET 


WALLACE A. KENNEDY WILLARD NELSON RONALD LINDNER 
HAROLD MOON AND JACK TURNER 


Florida State University 


PROBLEM 


The low ceiling of the 1937 Revision of the Stanford-Binet has been one of the 
major limitations encountered in its use with above-average adolescents and adults. 
@. 2) In spite of the provision of an average adult and three superior adult levels 
above the fourteen-year level, it happens disturbingly often that subjects pass items 
at the upper limit of the test. The Third Revision of the Stanford-Binet Intelligence 
Test“ presents some improvements: (a) The extension of IQ tables to chronologi- 
cal age 18; (b) the selection of the more discriminating items from both previous 
forms and combining them into one new form; and (c) the use of standard scores at 
each of the chronological age levels, with a mean of 100 and an SD of 16. 
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METHOD 


In the Summer of 1960, the authors had an opportunity to put the upper level 
of the 1960 Binet to a reasonable test, repeating, with minor modifications, a study “? 
of mathematically gifted adolescents. Twenty-nine eleventh-grade students partic- 
ipated in a Summer Mathematics Institute sponsored by Florida State University 
and the National Science Foundation. The program, which has been described by 
Nichols“, is in its third year. Students were selected for the program on the basis of 
mathematical ability as determined by scores on the Eleventh Annual Mathematical 
Contest sponsored jointly by the Mathematical Association of America and the 
Society of Actuaries. For the most part the students were the top 29 entries; an 
exception was made to obtain the intended minimum of 6 girls. In two instances girls 
were selected instead of male competitors with higher scores. All but 3 of the 29 
selected had reached their sixteenth birthdate; these three were aged 15-0, 15-6, and 
15-9. 

The Sequential Test of Educational Progress, Mathematics Form, was admin- 
istered upon arrival of the students at the University as a verification of their 
selection test scores. The range on the STEP Test was from 62 to 98; mean, 85.9; 
SD, 8.4. In addition to the STEP Test, two further measures were obtained: teacher 
rankings based upon subjective evaluation, and a ranking from examination scores 
in two subject areas—Theory, and Sets and Functions. The correlation between the 
the STEP Test were .79 for Theory rankings and .77 for Sets and Functions rankings. 

In the previous study “’, the WAIS and WISC rather than the 1937 Revision of 
the Binet were employed due to the upper limit restrictions imposed by the latter 
when administered to superior adolescents. Because of the age level differences of 
the 1959 Math Campers, both WISC and WAIS were administered despite their 
doubtful comparability. Consequently five of the fifteen-year olds were administered 
both instruments as a reliability check and the reliability scores were found to be 
within the standard error of the test. Therefore, in the present study, in an attempt 
to evaluate the intended improvements of the 1960 Revision of the Stanford-Binet, 
the WAIS was administered in conjunction with Forms 1A and 1B of the STEP 
Test and the 1960 Revision of the Binet. It was hypothesized that the latter should 
have an extended upper limit. 


RESULTS 


Concerning the ceiling of the new S-B, it was found that all of the students 
passed some items at all age levels; that is, a maximal level could not be obtained on 
these students. Thus one of the original objections to the Binet still seems to be 
applicable. 


In spite of this objection, however, the concurrent validity of the new S-B ap- 
pears to be high (Table 1). It correlates .52 with the Full Scale WAIS, .50 with the 


TaBLe 1. Means, SD’s anD CORRELATIONS BETWEEN MEASURES 





WAIS STEP 
Measure Mean 2. Full Verb. Perf. Total A 





WAIS 
Full 133.8 5.6 .43° 
Verbal 136.8 a .45°* 
Performance 125.1 6 .07 
STEP 
Total 85 
A 43 
B 42.‘ 
S-B (1960) 137. 
Theory 


Sets and Functions 





*p < .05. “p< OL. 





286 W. A. KENNEDY, W. NELSON, RONALD LINDNER, H. MOON AND J. TURNER 

total STEP, .48 with the Theory rating, and .53 with the Sets and Functions ratings. 
These correlations were significant at the .01 level of confidence. The WAIS Verbal 
correlated .61 with the S-B while the Performance correlated only —.03. While it 
has been pointed out °: »- 7 that the Wechsler Performance Scale fails to provide a 
correlation as high as the Verbal Scale with the S-B, it is interesting to note that in 
the present study no significant correlations were found between the Wechsler Per- 
formance Scale and any of the other measures obtained. Indeed, none of the cor- 
relations with this scale were above .12. But the results were very different with 
the Verbal Scale. The mean of the current sample on the Verbal Scale was 136.8, 
with an SD of 9.2. Thus the mean of 137.4 on the new S-B is very close to the mean 
on the Verbal Scale of the WAIS. 


SUMMARY 
It appears that improvements have been made in ceiling characteristics of the 
1960 Stanford-Binet Test, although inadequacies in ceiling still exist. There have 
been improvements in derived IQ’s due to selection of items and to the extension of 
the chronological ages in the tables. As a research instrument for the study of 
superior adolescents, the new S-B seems to render effective, though not ideal, 
measurement at the top of the scale. 
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HANDEDNESS AND DIGIT SYMBOL PERFORMANCE 
RICHARD J. BONIER AND CHARLES HANLEY 


Michigan State University 


PROBLEM 

The format of the Digit Symbol Test, used by Wechsler in his WAIS, WISC, 
and Wechsler-Bellevue measures, may handicap a certain type of left-handed sub- 
ject. Procedures in this speeded test require that subjects, working from left to 
right across the page, enter code symbols below numbers arranged in horizontal 
rows. A model containing the code appears directly above the answer section of the 
page. 

Right-handed individuals typically hold a pencil so that the hand lies below 
and to the side of the implement. This “conventional” posture is also found among 
left-handers. Some left-handed persons, however, grasp the pencil in a ‘‘crabbed’”’ 
fashion, positioning it between hand and writer. As the hand and arm move from 
left to right in this style of writing, they mask what has just been written and block 
the view of material above the line. With the Digit Symbol Test, the hand and arm 
in the crabbed style cover both previously encoded material and the model giving 
the correct symbol-number pairs. A conventional writing technique does not impose 
this potential handicap. The present study investigates the effect of these two 
different writing styles upon Digit Symbol performance. 
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In his instructions for WISC administration of the Digit Symbol Test (Coding 
B), Wechsler? indicates that examiners should ‘‘Place a second record form before 
the (left-handed) Subject folded so the relevant code is showing, since a left-handed 
Subject often covers the code when writing on the record form.”’ This modification 
of directions is absent in the manuals for the WAIS and the Wechsler-Bellevue. In 
the present study, comparisons are made between Digit Symbol Performance of 
“crabbed”’ and ‘‘conventional”’ writers using both standard and modified instructions 
for test administration. 


STANDARD INSTRUCTIONS 


Subjects. Because the ‘‘crabbed’’ technique is relatively rare in women, com- 
parisons are restricted to males, students in introductory psychology classes at 
Michigan State University. Of these, 37 were right-handed, 14 conventional left- 
handed, and 15 crabbed left-handed. 


Procedure. Subjects were tested individually in two sessions. In the first, each 
was given a special ‘“‘unbiased’”’ revision of the Digit Symbol Test employing the 
code used for Form I of the Wechsler-Bellevue. The format of the test was changed 
from horizontal to vertical alignment; that is, subjects worked down rather than 
across the page. On the form for right-handers, the model containing the number- 
symbol code was placed on the left-hand side of the page with the answer section to 
the right, while the numbers to be given symbols appeared in the left of each pair of 
boxes. These arrangements were reversed in the form for left-handers. Both forms 
are ‘“‘unbiased’’, because obstruction from the hand and arm in viewing the model 
and past work is considerably reduced for crabbed left-handers and absent for the 
other subjects. Five to eight weeks subsequent to the initial session, subjects re- 
turned to take the Digit Symbol Test in its common Wechsler-Bellevue form (Form 
I). Standard instructions were used in both this and the preceding session 


Results. Means and standard deviations for the three groups on both forms of 
the test appear in Table 1. Analysis of covariance holding constant individual 
differences on the ‘‘unbiased”’ version yields an F-ratio of 8.62 (df 2, 62), significant 
beyond the .005 level, for differences on the standard version of the Digit Symbol 
Test. An analysis of variance of scores on the “‘unbiased”’ test gives an F-ratio of 
1.86, not statistically significant. The ‘‘within-groups’”’ correlation between scores 
on the two sessions is .86. 


TaBLeE 1. Dicit Sympot Test PERFORMANCE OF RIGHT-HANDED, STANDARD LEFT-HANDED, AND 
CRABBED LEFT-HANDED SUBJECTS UNDER STANDARD AND MODIFIED INSTRUCTIONS 


Standard Instructions Modified Instructions 
Test Form N = 37) SLH(N = 14) CLH(N = 15) |SLH(N = 12) CLH(N 13 


Unbiased: ‘ 56.‘ 52.9 51.9 419.8 
; 6.44 6.45 8.11 


Mean | 52.6 61.0 54.3 60.1 
SD 5! 5.79 8.28 7.99 


These findings permit several conclusions. To begin with, the high correlation 
between forms indicates that the ‘‘unbiased’’ version measures much of what is 
tapped by the Wechsler-Bellevue form. Second, left-handers and right-handers are 
not appreciably different in coding ability as such; witness their performances on the 
“unbiased” form. Finally, the format of the Wechsler-Bellevue form handicaps 
crabbed left-handers. (The difference between crabbed and conventional writers on 
the W-B version of the test is large enough to yield significant differences without 
the covariance analysis; analysis of variance of performance on the second session 
gives an F-ratio of 5.69, significant at the .01 level.) 
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MopIFIED INSTRUCTIONS 


Subjects. The 25 left-handed subjects, 12 conventional and 13 crabbed, were 
males in introductory psychology classes at MSU in the term following the above 
study. 


Procedure. The procedure was the same as in the preceding investigation, 
except that on the second of the testing sessions the crabbed left-handers received 
the modified instructions quoted earlier from the WISC Manual. 


Results. Means and standard deviations of scores on both forms of the Digit 
Symbol Test appear in Table 1. Analysis of covariance of performance on the W-B 
form, holding constant individual differences appearing on the “unbiased”’ form, 
gives an F-ratio of 7.64 (df 1, 22), significant beyond the .025 level. The ‘‘within- 
groups” correlation between the two forms is .73. 

Table 1 gives very similar means for comparable groups. The difference be- 
tween conventional left-handers in the two studies is statistically insignificant using 
analysis of covariance; the same is true for the scores of crabbed left-handers, a 
further indication of the failure of the modified instructions to remove the handicap. 


DIscUSSION 

Consideration of a subject’s handedness as a factor in intelligence-test evalua- 
tion is apparently a fairly rare phenomenon. The fact that crabbed left-handers are 
significantly handicapped by the format of Wechsler’s version of the Digit Symbol 
Test indicates that they should, when responding to it as part of a full scale IQ test, 
be expected to obtain commensurately lower IQ scores. This assumption is war- 
ranted, of course, only to the degree to which it is safe to regard results when the 
Digit Symbol Test is given alone as comparable to results obtained for it in the 
course of a complete intelligence test examination. 

We cannot explain the results by making the assumption that crabbed left- 
handers are basically less intelligent than others, because this factor has been con- 
trolled statistically by use of ‘‘unbiased’’ pre-test scores and analysis of covariance. 
Unfortunately, an examiner who wishes to adjust the score of a crabbed left-hander 
to compensate for the bias has no normative material on which to base such a cor- 
rection. 

The study indicates that the modified instructions offered by Wechsler do not 
remove the handicap imposed by the format of the test. Few of the crabbed left- 
handers seemed to pay attention to the altered instructions, and a certain amount of 
confusion was apparent in those who did. Whether these instructions are more suc- 
cessful in the complete testing situation is an open question. The ‘‘unbiased”’ form 
of the test employed in the present study appears to overcome the handicap. This 
revision, however, is less practicable than Wechsler’s Digit Symbol Test because 
separate forms are required for right- and left-handers. 

These conclusions are based on results with college students. Perhaps they may 
be extrapolated throughout the entire range of intellectual ability. Two other 
alternatives, however, can be entertained; i.e., crabbed left-handers of lower intelli- 
gence may be handicapped less, or handicapped more, than is the case with the 
college students. Which of the three is correct is a matter for further investigation. 


SUMMARY 

A special version of the Digit Symbol Test (Form I, Wechsler-Bellevue), ar- 
ranged vertically so as to abolish any advantage stemming from right-handedness, 
was given to 37 right-handed and 14 left-handed subjects with ‘‘conventional”’ writ- 
ing technique, and to 15 left-handers who use a ‘‘crabbed”’ technique. Five to eight 
weeks later, the subjects were retested with the Wechsler-Bellevue form. Crabbed 
left-handers were significantly poorer in performance on this second session. 

Groups of 12 conventional and 13 crabbed left-handers were used in a study of 
the effectiveness of modified test instructions suggested by Wechsler. Again, results 
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indicate significantly lower performance for the crabbed left-handers on the standard 
form of the test. 

The ‘‘within-groups”’ correlations between scores on the first and second sessions 
of .86 in the initial study and .73 in the second, demonstrate that the special and 
standard versions of the Digit Symbol Test tap similar abilities. 

That differences between crabbed and conventional groups on the standard 
form of the test are due to writing technique and not to coding ability per se is shown 
by the use of analysis of covariance employing scores on the special version of the 
measure as the matching variable. In addition, when subjects are compared in per- 
formance on the special version, no significant differences appear. The format of the 
standard version of the Digit Symbol Test, it appears from these findings, imposes 
a handicap specific to left-handers who write with a crabbed style of penmanship. 
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A VALIDITY STUDY OF TAMKIN’S “MMPI SCALE MEASURING 
SEVERITY OF PSYCHOPATHOLOGY” 


DANIEL R. LUTZKER 
Fort Diz, N. J. 


PROBLEM 

Jn 1959 Tamkin®? presented some preliminary data on an MMPI scale measur- 
ing the severity of psychopathology (P) consisting of the F scale and the non-over- 
lapping items of Grayson’s®? Critical Item (CI) scale. For convenience, the non- 
overlapping items of the CI scale are called X in this paper. Tamkin®? reports that 
a sample of 50 psychotics scored significantly higher than a sample of 25 neurotics 
and personality disorders. No data are given for normal subjects. This paper 
presents data regarding the performance of normal subjects, tests the ability of the 
scale to discriminate between groups differing in degree of psychopathology, and 
provides information concerning the relative usefulness of the P, F and X scales in 
distinguishing between such groups. 


METHOD 

MMPI answer sheets were obtained for five groups of subjects. Basic trainees 
at Fort Dix who were in their first (N = 28), fourth (N = 34) and eighth (N = 31) 
week of training served as the normal control groups. These subjects had been 
tested as part of a research project“) on the effects of basic training. Because of the 
stress encountered in such training it was expected that measures of psychopath- 
ology would show a steady increase as training progressed. All three groups of 
normal basic trainees, however, were expected to score significantly lower than the 
experimental groups. The experimental subjects were first court-martial offenders 
(N = 45) and 25 consecutive patients who were administered the MMPI for diag- 
nostic purposes after January 1, 1959. These groups were selected so as to include 
only male basic trainees. The group of patients contained both in-patients and out- 
patients and the subjects varied considerably in their symptomatology. 


'The author gratefully acknowledges the assistance of Stanley G. Fullwood and Christian Kristen- 
sen in the statistical treatment of the data. 
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The MMPI answer sheets were scored for all of the clinical and validity scales, 
the P and X scales, and for mean T-score of the clinical scales. The last of these 
measures was used as a criterion of degree of pathology. 


RESULTS 
The results in Table 1 show that for all three scales there is an increase in mean 
score as psychopathology increases and that a wide gap exists between the three 
normal and the two abnormal groups. This suggests that all three scales measure 
psychopathology. However, the size of the 8.D.’s indicates severely limited useful- 
ness for individual prediction as Tamkin found. 


TABLE 1. MEANS ANDS. D.’s ON THE F, X anD P MMPI Sca.es For Five Groups 


Group N F 


Mean S. D. Me: S. D. Mean 
NBT 1 28 4.07 : 
NBT 4 K 5.15 52 
NBT 8 4 7.06 ; 
FCM 5 13.90 2.17 3.¢ 
Patients 25 14.80 71 ) : zi. 


5 
7 
9 
18 


The mean P score for normal basic trainees in the eighth week of training falls 
about half way between those of the two pathological groups studied by Tamkin. 
This may reflect increased anxiety due to situational stress. An alternative hypoth- 
esis is that some of the subjects may have been feigning pathology in the hope of 
obtaining a discharge from the Army. 

A further check was made on the validity of the F, X and P scales. Each scale 
was correlated with the mean T score (based on the nine clinical scales) for the sub- 
jects in the three normal groups. The resulting correlations were: between mean T 
and F, .84; between mean T and X, .65; and between mean T and P, .87. Using 
mean T as the criterion of psychopathology, all three scales are shown to have some 
validity. However, P does not appear to correlate better with mean T score than 
does F and therefore the value of P is dubious. This conclusion is supported by the 
fact that both the F and P scales discriminate all of the normal groups from the two 
pathological groups at the .01 level but fail to discriminate between the two patholo- 
gical groups. F and P can be used interchangeably for all practical purposes and 
there is little to be gained by scoring answer sheets for P when F is already available. 


SUMMARY 
Validity data, both predictive and correlational, are presented on Tamkin’s 
Psychopathology (P) scale. While the results indicate that the scale is valid, its 
validity is no greater than that of the F scale. Therefore it does not appear worth- 
while to score MMPI answer sheets for this new scale when a conventional scale is 
equally useful. 
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AN ABBREVIATED QUALITATIVE SCORE FOR THE PORTEUS MAZES 


JOHN R. BARRY 
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KLEMAN 


Psychological Research Center, University of Hawati 


While scoring the qualitative errors on the 
Porteus Maze Test“), it was noted that two sub- 
scores, Crossed (or touched) Lines, and Lift 
Pencil, each contributed larger portions of the 
variance to the Qualitative (Q) Score than did the 
other six subscores. The variances based upon 
data from a sample of 70 white, female, hospital- 
ized mental patients were: Lift Pencil 69.56; 
Crossed Lines 34.57; First Third Error 17.81; 
Wavy Lines 16.32; other subscore variances were 
smaller. It is proposed that the sum of the two 
subscores having the largest variances, Lift Pencil 
and Crossed Lines, be used in place of the Q 
Score for ordering patients along the qualitative 
performance dimension. 

In two sub-samples, Pearson r’s between the 
Q Score and the sum of the two subscores were: 
.91(N = 40) and .90(N = 30). The sub-samples 
consisted of white females, aged 20 to 48, tested 
at the time of their admission to a university 
psychiatric hospital. From the size of these r’s, 
it appears that the sum of these two subscores 
can be used to order the patients in approximate- 
lv the same way as the Q Score, with a saving in 
scoring time and less chance for error. 


This observation was tested using data from 
two other independent samples. One consisted of 
42 normal Chinese boys and 57 normal Chinese 
girls in the ninth grades of three Honolulu public 
schools. The other group consisted of 49 boys and 
40 girls tested at the Hawaii Boys and Girls 
Training School, a state institution for delin- 
quents. Pearson r’s between the Q Score and the 
sum of the Lift Pencil and Crossed Lines sub- 
scores were .86 for the Chinese group, and .93 for 
the delinquent group. 

The consistency of these relationships between 
the Q Score and the subscores, manifest in three 
different samples, supports the generality of the 
finding. Therefore it is proposed that the sum of 
the two subscores, Lift Pencil and Crossed Lines, 
may serve as an adequate substitute for the more 
complex Q Score, when ranking or ordering of the 
subjects is the goal. 
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THE APPLICABILITY OF AN AMERICAN DELINQUENCY SCALE TO 
JAPANESE SUBJECTS 
T. SHIRASA AND T. AZUMA! 


Hokkaido University 


PROBLEM 

In 1958, Quay and Peterson®) reported a 
brief scale for the differentiation of delinquent 
and nondelinquent males. [In several cross- 
validational samples of American boys, approx- 
imately 67 per cent of the subjects (Ss) were cor- 
rectly identified by the measure. he applic- 
ability of such scales to other populations, how- 
ever, is always open to question, and the present 
study was undertaken to determine whether 
similar differentiation could be achieved between 
delinquent and nondelinquent adolescents in 
Japan. 


SUBJECTS AND PROCEDURES 

The scale was translated into Japanese and ad- 
ministered to a total of 357 Ss. Forty of these 
were in a training school for delinquents at 
Engaru in Hokkaido. Another delinquent sample 
was gathered by testing 130 Ss in a corrective 
school at Chitose. But it seems to be some differ- 
ence between two these samples in a degree of de- 
linquency proneness. On the other hand, 187 Ss 


were chosen, for nondelinquent sample, from 
public secondary and high schools at Sapporo. 


RESULTS 

The major findings of the study are presented 
in Table 1. 

A cutting score between 6 and 7, as suggested 
in the original report), leads to proper identi- 
fication of 88 per cent of the training school Ss at 
the expense of 26 per cent incorrectly classified 
Ss among the public school pupils. Boys in the 
corrective school displayed intermediate scores. 
Their mean score was 8.11, as compared with 
11.75 for Ss in the training school, and 4.99 for 
nondelinquents. 

A split-half reliability of .81 was found for the 
training school Ss. Separate reliabilities were 
computed for primary and secondary corrective 


‘We wish to thank the administrators of the 
schools for permission to test the subjects, and 
Donald R. Peterson of the University of Illinois 
for help in preparing the manuscript. 
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school boys and odd-even coefficients of .71 and 
75 were found. A Kuder-Richardson coefficient 
for public school Ss was .75. The scale was re- 
administered after a two-week interval to stu- 
dents in the high school, with test-retest correla- 
tion of .85. 


DISCUSSION 

The discriminative power of the scale in our 
sample is actually as high as in the groups on 
which it was originally developed. The corre- 
spondence between assumed degrees of socializa- 
tion or delinquency proneness and scores on the 
measure is also surprisingly close, considering all 
the differences between the present Ss, and those 
examined in deriving the scale. But the findings 
go beyond a demonstration of validity; they offer 
a fairly important theoretical implication. 

Recently Gough“? proposed that certain con- 
cepts and dimensions, such as socialization, re- 
sponsibility, etc., represent cultural universals, 
i.e., they are applic able to all societies at all times 
and in all plac es. Asa start in testing this proposi- 
tion, the Socialization (So) scale of the California 
Psychological Inventory was translated into 
Hindi, Punjabi, Spanish, and Italian, and ad- 
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ministered to various samples in India, Costa 
Rica, and Italy. Delinquent and nondelinquent 
Ss in all these societies were differentiated about 
as sharply as in the United States, and the ap- 
plicability of the So scale to rather ‘widely differ- 
ent cultural groups was quite impressively estab- 
lished “). 

Quay and Peterson’s delinquency scale®) cor- 
relates about —.70 with the So scale, and can 
probably be regarded as another measure of 
socialization. The present results thus offer addi- 
tional confirmation of Gough’s contention re- 
garding the cultural generality of the dimension, 
and vitiate still further the usual pessimism about 
allegedly invalidating ‘cultural drifts’ and 
“moving criteria” in empirically developed scales 
such as these. The extent to which a person has 
incorporated cultural norms is probably import- 
ant in any society, no matter what the particular 
content of those norms might be. There is now 
increasingly good reason to suppose that existing 
measures of socialization transcend linguistic, 
racial, and cultural limitations to a degree never 
dreamed of when the scales were first developed. 
All this opens up some rather exciting possibilities 
for cross-cultural research on personality, quite 
beyond any practical descriptive or predictive 
utility the scales might have. 


SUMMARY 

The cultural generality of an American de- 
linquency scale was examined by administering 
it to delinquent and nondelinquent boys in 
Japan. The discriminative efficiency and re- 
liability of the scale were approximately as high 
as in the United States. Since the scale can be 
regarded as a measure of socialization, the de- 
monstrated generality of that dimension has now 
been extended beyond the United States and 
three other societies, to another culture in the 
Orient. 
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VISUOCONSTRUCTIVE AND VERBAL RESPONSES IN CHRONIC 
BRAIN-DAMAGED PATIENTS AND FAMILIAL RETARDATES* 
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and 
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INTRODUCTION 


This study reports a joint investigation by the respective psychology depart- 
ments of a hospital for epileptics and a home and training school for mental retard- 
ates into differential diagnosis of chronic brain syndrome and familial mental retarda- 
tion. In the past, a number of studies involving organic populations have reflected 
the imprecision of the term “brain-damage”’ by failing to take into account signi- 
ficant behavioral differences resulting from localized areas of neurologic abnormality. 
Neurologic studies ®: ® relating functioning behavior to locus of cerebral injury or 
abnormality have shown, however, that speech difficulty and visuo-constructive or 
perceptuomotor disabilities seem to stem from certain unilateral lesions or cortical 
malfunctions in the left and right hemispheres. 

Anderson“), Stacey and Levine“?’, Klgve“?, and Reitan®: !°) established that 
an IQ difference between verbal and performance scales has value in differentiating 
left-right and bilateral brain damage in that damage to the left hemisphere was 
associated with lowered verbal scale scores, damage to the right hemisphere with 
lowered performance scores, and diffuse or bilateral damage with slightly depressed 
scores on both verbal and performance scales. Balthazar and Morrison ®? reported 
that a difference of seven IQ points or greater between Wechsler performance and 
verbal items was of diagnostic value in determining predominant left-right and in- 
determinate unilateralized brain damage. 

Hunt ‘*) found that familial retardates were superior to brain-injured defectives 
on most visuoconstructive tasks. Gallagher ‘“¢’ compared matched brain-injured and 
familial groups and found those with organic brain syndrome to be inferior to the 
familial group in visuoconstructive or perceptuomotor ability, in integrating verbal 
concepts, and in making verbal associations. Sievers“! reported familial cases to 
equal or exceed the language level of the brain-injured. No controls were exercised 
in these studies regarding locus of cerebral abnormality and functioning behavior. 

This study tests the hypothesis that minor visuoconstructive abnormalities 
underlie interhemispheric involvement by comparing known brain-damaged cases, 
each having predetermined unilateral abnormalities, with cases classified as so-called 
familial or undifferentiated retardates. We predicted specifically that organics with 
predominant left-right and indeterminately unilateralized hemispheric abnormal- 
ities would tend to be consistently lower on Block Design, Object Assembly, and 
Digit Symbol subtests than those classified as familial retardates. The study will 
also examine verbal ability in the two groups. 


METHOD 


Subjects were 90 chronic organic patients at the Caro State Hospital for Epilep- 
tics who were administered Form II of the Wechsler-Bellevue Scales as part of rou- 
tine admission procedure in the standard manner™*). Criteria for the selection of 
organic subjects were that subjects have (a) unequivocal evidence in their medical 
histories, EEG’s, and general medical examinations of organic brain syndrome; (b) 
parents and siblings of estimated average intelligence; (c) no gross neurologic or 


*The authors are grateful to C. David Hellman for interpreting the electroencephalograms. 
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sensory abnormalities; and (d) valid Wechslers indicating defective intelligence. An 
eight-channel Grassi, Model 3 AR, ink-writing electroencephalograph with ten elec- 
trodes was used to make bipolar recordings and to determine, along with Wechsler 
scale discrepancies, hemispheric localization of brain damage. Each patient was given 
two or more EEG’s. 

Group R, on the basis of the above criteria, consisted of 30 patients diagnosed as 
having predominant right-sided brain damage. Group N included 30 patients with a 
diagnosis of indeterminately unilateralized or bilateral brain damage. Group L was 
composed of 30 patients classified as having predominant left-sided brain damage. 
The term ‘‘predominant”’ served to qualify classifications of right-sided and left- 
sided lateralization because no rigid controls existed for minimal diffuse or bilateral 
damage which could have caused mild disturbance in the contralateral hemisphere. 

An additional group of 30 subjects, Group F, from the Lapeer State Home and 
Training School was also administered Form II of the Wechsler Scales. The criteria 
for the selection of Group F were: (a) all were mentally retarded on the basis of the 
Wechsler; (b) there was unequivocal evidence of mental retardation among their 
parents and siblings; (c) there was no evidence of trauma contributing to mental 
retardation; and (d) there was no evidence of chronic or acute brain syndrome in the 
medical histories or examinations. Group F, therefore, consisted of individuals classi- 
fied in a state home and training school as familial or undifferentiated retardates. 
All organic groups, Groups R, N, and L, as well as the non-organic Group F, were 
equated on the basis of full scale I[Q, age, and sex. Subtest means and standard de- 
viations were calculated for each of the groups and the subtest mean differences were 
compared and tested for significance by the ¢ test. 


RESULTS 


There were significant weighted score differences between Group rR (predomi- 
nantly right-sided brain-damaged), and Group F, (familial), on the Information, Arith- 
metic, and Similarities subtests of the verbal scale, and Picture Arrangement, Picture 
Completion, Block Design, Object Assembly, and Digit Symbol subtests of the per- 
formance scale. Group R was clearly superior to Group F on verbal test items but not 
on the performance scale. Group F achieved significantly higher weighted scores on 
all performance test items including visuoconstructive tasks. 


TABLE 1. Scores, MEAN DIFFERENCES AND LEVELS OF SIGNIFICANCE BETWEEN W-B SuBTESTS FOR 
FamMILiaL (F), Right HEMISPHERE ORGANICS (R), INDETERMINATE ORGANICS (N), AND LEFT 
HEMISPHERE ORGANICS (L) 
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When Group nN, those with bilateral or diffuse brain damage, was compared to 
Group F, verbal superiority, favoring the organic group, was reduced to the Informa- 
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tion and Arithmetic subtests. Group F achieved significant scores in the Object 
Assembly and Digit Symbol subtests. 

There were no statistically significant differences in verbal scale subtests when 
Group L, the predominantly left-sided organics, was compared with Group F. The 
previously noted trend in favor of the organics did not occur in the left-sided group. 
Group F maintained superiority in Object Assembly and Digit Symbol subtests and 
both groups attained their highest scores in these two subtests. 


DISCUSSION 


The right-sided group was unique in showing significantly lower scores on Block 
Design. Reitan“® in an unpublished monograph reported individual examinations 
indicating impairment on Block Design to be associated with right hemispheric in- 
volvement. Present findings, however, regarding brain-damaged groups, R, N, L and 
the familial group F, with the notable exception of the Block Design subtest, tend to 
support the hypothesis that basic differences between these organic groups and 
familial retardates seem to lie in visuoconstructive responses to a degree independent 
of hemispheric unilateralization. In this, our findings are in gross agreement with 
previous psychological studies regarding perceptuomotor differences. They are not, 
however, wholly consistent with results obtained in the investigations of Gallagher 
(4) and Sievers“? regarding verbal factors in differentiating organic from so-called 
undifferentiated groups. 

Using institutional samples, the right-sided group showed superiority over fam- 
ilials in verbal scale items. This was less true of indeterminately unilateralized cases 
and not true of predominant left-sided cases. Wechsler verbal scale test items are 
perhaps more general in nature than the more specialized language tests of Gallagher 
‘#) and especially Sievers“!). It seems difficult, however, despite the possible limita- 
tions of omnibus testing, to believe that an organic group whose cortical abnormali- 
ties are, for the most part, confined to the minor hemisphere should be verbally in- 
ferior to a group whose language difficulties seemed sufficient for institutional com- 
mitment. 

While the findings suggest behavioral differentiation of measured abilities in the 
two hemispheres, inquiry is necessary to clarify further probability of more precise 
localization. This is a question we cannot answer with our present data. The ques- 
tion of localization is a major one, since the inaccuracy of free hand placement of 
electrodes, the superficiality of cortical contact limiting hemispheric localization, and 
sampling error play an essential part. Reitan“?, in a rigidly controlled study, was 
forced to a somewhat similar conclusion regarding the occurrence of vascular changes 
and edema in the hemisphere. 

Hécaen, Penfield, Bertrand, and Malmo? in a ten-year study using precise neuro- 
logic controls concluded that visuoconstructive disabilities are integral parts of both 
hemispheres. They reported also that disturbances of spatial orientation, which we 
believe to underlie visuoconstructive disability, occurred especially from right parieto- 
temporal lesions. This would seem consistent with our results that visuoconstructive 
difficulty occurs with left hemispheric involvement, or with malfunction in both 
hemispheres, but is more especially marked in the right hemisphere. 

Present findings imply that visuoconstructive responses may have some diag- 
nostic value in discriminating chronic cases of organic brain syndrome from the un- 
differentiated, subcultural familial type in terms of measured abilities. Such re- 
sponses may also underlie prior considerations of hemispheric localization to the 
extent that they may be basic to both hemispheres and with Wechsler scale dis- 
crepancies may detect unilateral lesions or malfunctions independently of sup- 
plementary neurological information. To investigate this possibility, however, addi- 
tional study must undertake to determine whether psychometric study can accurate- 
ly detect behavioral manifestations of inter-hemispheric symptoms and whether de- 
tection of these symptoms validly discriminates additional diagnostic groups. 
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SUMMARY 


The use of Form II of the Wechsler-Bellevue Scales to discriminate 90 organic 
brain-damaged patients and 30 institutionalized familial retardates was investigated. 
Results showed that those with predominant right-sided brain damage were superior 
to familials on verbal items and inferior on all performance tasks, including visuo- 
constructive test items. Closer score approximation was achieved when diffuse brain- 
damaged cases were compared to familials, but brain-damaged cases maintained some 
verbal superiority as well as performance inferiority on visuoconstructive tasks. The 
exception was Block Design. When predominant left-sided organics were the com- 
parison group, however, verbal superiority was lost. With the exception of Block De- 
sign, the familial group maintained superiority in visuoconstructive responses. Im- 
plications of these findings were discussed in terms of the basic hypothesis, and con- 
sideration given to further study to determine if visuoconstructive responses, com- 
bined with Wechsler scale discrepancies, are essential in detecting hemispheric uni- 
lateralization independently of neurologic criteria. 
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STANDARDIZATION OF THE GRAPHOSCOPIC SCALE: 
THE CONTENT OF CHILDREN’S DRAWING* 
JUSTIN PIKUNAS AND HUGH CARBERRY! 

University of Detroit 


PROBLEM 

This study investigated some relations between developmental level and content 
of children’s drawings on the basis of partially structured perceptual cues. The 
majority of studies in the literature dealing with the content of children’s drawings 
have been longitudinal studies. Eng’s“? detailed study of the drawings of her niece, 
Hildreth’s ‘*? investigation of more than four thousand drawings made by a boy be- 
tween the ages of two and eleven years old, and McCurdy’s? study of more than 
one thousand drawings made by a boy between the ages of two and one-half and 
seven years, are major sources. The leading cross-sectional study conducted on the 
content of children’s drawing was McCarty’s®? analysis of 31,239 drawings of 
children between the ages 4 and 8 years. 


METHOD 

The instrument used in collecting the data for this study was the Pikunas 
Graphoscopic Scale (PGS) *’, which is a semistructured projective drawing test 
primarily designed for children. The PGS contains ten drawing plates, eight of 
which have semistructured perceptual cues, while one is fairly structured but is 
compensated by another which is free drawing of the ‘“‘object you most often draw,” 
and the eleventh is a written answer to the question, ‘‘What else would you like to 
draw?” The fact that the PGS is a semistructured test adds to the interest of this 
study of the content of children’s drawings for, up until this time, the majority of 
studies has been centered around either free or particular drawings®: ®?. 

Our standardization sample consisted of 350 children from the Detroit area 
schools, who ranged in age from five to fifteen years and who provided approximately 
3,200 individual drawings. 

The focus of this statistical content analysis will be on the following three areas: 
(a) a comparison of the four most frequently drawn content categories at the various 
ages, (b) a comparison of the content drawn at three major developmental levels; 
namely, middle childhood, late childhood, and early adolescence, and (c) an analysis 
of the specific tendencies depicted for most perceptual cues of the PGS. 


RESULTS 

Table 1 compares the four most frequently drawn content categories at the 
various ages and reveals that the overall chi square value of 151.87 is significant at 
the .01 level. The chi square values obtained for each of the four major content 
categories were also significant at the .01 level. Hence we can conclude that the 
various ages did differ significantly in the frequency of the content drawn. In order 
to make the frequencies more meaningful they have been reported in terms of percent 
drawn at each age. 

Table 1 also shows that the most frequently drawn objects for the group as a 
whole were those involving the human person. In the “human total’ category 
(which is a summary percentage including total human persons, human heads, halves 
of human bodies and human details), a definite trend was noted in that there was a 
gradually decreasing percentage of human content as the chronological age increased. 
This finding at the earlier age levels (five through eight years) is in agreement with 
McCarty ©’ who interpreted this trend as an increasing interest in other areas. The 
human form drawn at all age levels was usually an adult figure. An increase is again 

*This study was in part presented at the 1960 convention of the American Psychological Associa- 
tion, Chicago, Illinois. 

1Now at Villanova University. 
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TABLE 1. PERCENT OF THE Four Most FREQUENTLY DRAWN CONTENT 
CATEGORIES AT THE V ARIOUS AGE LEVELS 


Factory made 
N Human Objects Animals Houses 
31 51.0% 26.7% 
25 41.8 40 
31 35.5 40 
‘ 39 36 .¢ 34.7 
9 34 7 30 
10 33 ‘ 45.3 
11 33 35 34 
12 34 4 32 
13 30 3 31. 
14 46 y 48 
15 13 0.2 31 
Drawings N 3074 1140 1131 
a 63. 18* 155 .65* 60.24* 


*Significant at the .01 ary 
Overall chi square for Table 1 is 151.87, significant at the .01 level. 


noted at the eleven through thirteen ages. However, this does not exceed the per- 
centage at the lowest age level and may be interpreted as one of the consequences of 
the increased social awakening typical ‘of these years. 

The second most frequently drawn type of objects for the group as a whole was 
‘‘factory-made objects’. In this category a fairly steady increase with age occurs, 
with the lowest percent being drawn by the five year age group. The peak percentage 
for this category was reached at the ten and fourteen year age groups. The “factory- 
made objects’”’ category, due to its inclusive definition, also yielded more specific 
trends within the different age levels, ranging from simple household objects and 
depictions of toys at the earlier age levels, to the depiction of moving objects, like 
trains, cars and boats at the older age levels. The drawing of dresses and hats by the 
girls at the older age level was also revealed, pointing to the possibility of sex role 
awareness and acceptance. 

The third most frequently drawn object was found in the animal sone. 
a rally, the percentage of animal content drawn fluctuates from year to year. The 

ak of animal drawing was reached at the thirteen year level, w hile the low a per- 
c aorvheen of animal content were drawn at the five, six, and seven year level. The lack 
of emphasis on animals at the younger ages is in direct contrast to the young child’s 
tendency to identify with animals. This is probably due to the emphasis placed on 
human figure drawing in the school and home. While animal content was found to be 
consistently lower than human content at all ages, the manner of depicting animals 
at the various ages differed considerably. At the earlier ages there is very little dis- 
tinction between the animal drawings and the drawings of the human figure; the 
distinction is largely made by the child’s labeling of the content. As the age level 
increased, a change in orientation appeared, with the animal’s body usually placed in 
a horizontal position, and with detail appearing in terms of appendages, such as tails, 
whiskers and fur. The rather large increase in percent at thirteen years may be re- 
lated to an expression of the increasing instinctual urges which are concomitant with 
puberty, and also as a rise of interest in pets. 

Houses were the fourth most frequently drawn objects, but the only discernible 
trend revealed was the rather steady decrease in percent in the older age groups 
(eleven through fifteen). This decrease may be the consequence of the child’s in- 
creasing maturity and his attempt to depict more complex objects. A general im- 
provement in the symmetry, neatness, and number of details in the houses was also 
a product of increasing age. At the upper age levels, the drawing of a house was 
usually accompanied by surrounding details such as flowers, trees and grass, which 
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made the portrayal more of a scene than the isolated drawings of a house that ap- 
peared at the earlier age levels. There was greater tendency to draw buildings other 
than houses as the age level increased. Furthermore, content dealing with nature, 
landscapes, designs, abstract material and plant life were among the more frequent 
Drawings of food, usually of the sweet variety, emerge at seven and are fairly dom- 
inant up to eleven. The striving for completion and composition was evidenced in 
all the objects drawn and was a definite accompaniment of increasing mental growth 


Developmental levels. In an attempt to obtain a clearer picture of the growth 
trends, an analysis of the content was made of the four major categories at three 
basic developmental levels: middle childhood (7 and 8 year group), late childhood 
(10 and 11 year group), and early adolescence (13 and 14 year group). The results 
given in Table 2 show that the overall chi square (48.53) is highly significant at the 
.01 level. From this it may be concluded that the groups differed significantly in 
terms of the content drawn, although the chi square values obtained for ‘“human”’ 
and ‘‘factory-made objects’’ did not reach the required level of significance. These 
categories are almost equally popular at all three developmental levels 
TABLE 2. PERCENT OF THE Four Most FREQUENTLY DRAWN CONTENT CATEGORIES AT THREE 

Masor DEVELOPMENTAL LEVELS 


Middle Late Early Total 
Content Childhood Childhood Adolescence Frequency 


Human 35.8% 30.0% 31.22% 
Factory-made 

objects 37.6 10.1 41.0 
Animals 10.6 13.4 20.1 
Houses 15.8 - 16.2 ee 
Total Drawn 622 632 


646 


The overall chi square for Table 2 is 48.53, significant at the .01 level 
*Significant at the .01 level 


In the ‘“‘animal”’ category, which shows siguificant differences, there is a steady 
increase in percent drawn from level to level. Again, there is evidence for the ex- 
pansion of interest due to life experiences. It has been suggested that the sharp rise 
in percent at early adolescence may be interpreted as an expression of the vitalized 
instinctual drives typical of the early adolescent level of development 

In the ‘‘house”’ category, there is a slight increase in percent in late childhood 
followed by a decrease in early adolescence. In early adolescence the drawing of 
houses is apparently replaced in part by more complex architectural objects. Eman- 
cipation from the home occurring at this age is possibly a factor of high significance 


Item Findings. In order to see the usual and popular, an analysis of the content 
drawn for each PGS item (plate) was also made. The percentages mentioned in this 
analysis are based on the total number of drawings at the various ages. The major 
trends for each PGS plate number are noted in the subsequent summary, except the 
first which is too structured to have relevance for any content analysis. 

The most frequently drawn objects for Item 2 were animal figures; while the 
second and third most frequently drawn objects were the human figure and factory- 
made objects, respectively. At the lower age levels (5, 6 and 7 years) the percentage 
of animals drawn (25.8, 38.3, 19.4) is lower than the human content (34.4, 34.4, 41.5), 
but then it gradually rises, reaching a peak at 13 years (70.8). The highest percent- 
age of human content is reached at seven years (41.5), and then there is a gradual! 
decline reaching the lowest frequency at 11 years (18.6) and then rising again at 12 
years (34.2). In the frequencies of factory-made objects no trend is indicated. 

The most frequently drawn object for Item 3 was the human category while the 
second and third most frequently drawn objects were animals and factory-made 
objects, respectively. The trend revealed in human category is essentially the same 
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as in Item 2 where the highest percentage was in the early years (63.3, 70.3, 54.0) 
with a gradual decline followed by a rise at the 12 year level (55.8). The lowest per- 
centage of animal content is found at years 5 (3.3) and 6 (3.7) followed by an inter- 
mittent rise and fall in percentages at the other age levels. The highest percentages 
of animals drawn are found at years 8 (34.1) and 13 (30.0). The highest percentage of 
factory-made objects was at the 13 year level (20.0). No trend is indicated for this 
ategory of content. 

The most frequently drawn object for Item 4 was human, followed by factory- 
made objects and animals. The highest percentage of human content was at the 8 
year level (67.5) and the lowest at the 5 year level (42.7). The highest percentages of 
factory objects were found at the 5 (35.7) and 10 year level (34.4), while the highest 
percentage of animals drawn is found at the 13 year level (30.0). For either category 
of content no age trend is indicated 

The highest frequency for Item 5 was factory-made objects followed by human 
drawings and food. None of the content categories reveal any age trend. The highest 
percentages of drawings in this category were found at the 12 (57.1) and 14 year 
levels (65.9). The lowest percentages of human content drawn were the 12 (35.6) and 
14 year levels (24.9), while the highest percentages for food category were found at 
the 13 (17.2) and 11 year levels (13.7). 

The most frequently drawn objects for Item 6 were factory-made objects follow- 
ed by human content and plants, respectively, with no trend indicated on either con- 
tent. The highest percentage of factory-made objects was found at 12 years (67.8). 
and the lowest at 5 (37.9), while the highest percentage for plant category was found 
at the 5 year level (20.6). 

The most frequently drawn objects for Item 7 were factory-made objects follow- 
ed by aircraft and nature. A gradual increase in factory percentages is noticed reach- 
ing a peak at the 10 year level (67.6) and dropping only slightly thereafter. The low- 
est percentage in this category is found at the 5 year level (23.3). The highest per- 
centage was found at the 6 year level (32.1) for aircraft category while the lowest was 
at the 12 year level (30.0). The highest percentage for nature category was found at 
the 8 year level (31.4). In the last two content categories no age trends are shown. 

The most frequently drawn objects for Item 8 were factory-made objects, closely 
followed by houses and then human content. The highest percentages for factory 
objects are found at the 8, 9 and 10 year old levels (39.4, 41.1, 45.4). The highest 
percentages were found at the 11 and 12 year levels (58.8, 58.8) for houses while the 
lowest percentage was found at the 5 year level (7.1). The highest percentage of 
human content was at the 6 year level (20.4). No trend is discernible in either con- 
tent category. 

The most frequently drawn object for Item 9 was factory-made objects followed 
by human content and houses, respectiveiy. In factory-made objects no trend is 
indicated. The highest percentages were found at the 12 (44.7) and the 14 year level 
(47.7). Without a trend, the highest percentages of human content were found at 
the 8 and 11 year level (46.9, 45.8). In house drawing a gradual rise is seen reaching a 
peak at 10 years (20.5) and then declining. 

The most frequently drawn object for Item 10 was factory-made objects follow- 
ed closely by human content and houses. In factory objects and houses no trend is 
indicated. The highest percentage for the first category was found at the 10 year 
level (38.2) while for the houses it was found at the 5 year level (50.0). In human 
content there is a gradual decrease in percentages up to the 11 year where a slight 
increase is noted. The highest percentage of this category was the 5 year level (32.3) 
and the lowest at the 9 (8.5) and 10 year levels (8.7). 

The most frequently mentioned objects for Item 11 were human followed by 
houses and factory-made objects. From the 6 year level on, there is a gradual de- 
crease in human percentages to the age of 10 followed by a gradual increase in the 
later years (11, 12 and 13). The highest percentage of human content is found at the 
13 year level (35.7). The high for houses category is found at the 5 year level (67.0) 
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and again at the 8 year level (50.0). The highest percentages for factory objects 
were found at the 7 (32.2) and 10 year levels (32.3) while the lowest percentages were 
found at the 5 year level (5.8). 


SUMMARY AND CONCLUSIONS 


A total of 350 Detroit area children between the ages of five and fifteen provided 
approximately 3,200 drawings on the PGS plates. Tabulations and analyses of the 
drawing content result in the following major findings: 

1. The group as a whole tended most frequently to draw human individuals, 
factory-made objects, animals and houses, respectively. Significant differences for 
each of the four major categories were found at all ages. Using the chi square tech- 
nique, differences were found to be significant at the .01 level. In the “human”’ con- 
tent category there was a general decrease in the percent of human content drawn 
as the chronological age increased. The increment in the drawing of ‘‘human’’ con- 
tent at the later ages (12 and 15) was interpreted as one consequence of increased 
social awareness. In the ‘“‘factory-made objects” category an increasing percent was 
noted as the age increased, with a peak being reached at the ten and fourteen year 
age groups. In the ‘‘animal” category no discernible trend was noted but a relatively 
high peak was reached at thirteen years. This high peak was interpreted as an ex- 
pression of the early adolescents’ magnifying instinctual urges. A decreasing percent 
of ‘“‘houses’”’ drawn in the eleven through fifteen age brackets was the only discern- 
ible trend noted for the “‘house”’ category. 

2. The stimuli of the PGS were found to be rather well adapted for eliciting a 
great variety of content at all ages. The most recurring categories other than the 
above mentioned were as follows: nature, landscapes, designs, abstract material, 
plant life and food. 


3. A comparison of the content drawn at middle childhood, late childhood 
and early adolescence revealed significant frequency differences in the ‘‘animal”’ 
and ‘‘factory-made objects’’ categories. In the ‘‘animal”’ category a gradually in- 
creasing percent of drawings was noted from earlier to later developmental levels. 


4. Inan analysis of each particular item of the PGS, specific trends common to 
that item were also identified. 


5. A striving for completeness, composition, improvement in orientation to- 
ward test cues, and enlarged content and its detail production were noted as the 
chronological age increased. 
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STRUCTURED PERSONALITY TESTING IN THE AGED: AN MMPI 
STUDY OF THE GERONTIC POPULATION! 
WENDELL M. SWENSON 


Mayo Clinic, Rochester, Minnesota 


PROBLEM 


Careful perusal of the literature reveals no personality studies among the aged 
in which the Minnesota Multiphasic Personality Inventory (MMPI) was used as a 
primary psychometric device. Calden and Hokanson“? included 15 male tuber- 
culous patients aged 60 to 69 in a study measuring the influence of age on MMPI 
results. The purpose of the present study is twofold: to indicate how well elderly 
individuals can be tested with presently available structured personality tests, and 
to describe a sample of the nonhospitalized aged population in terms of the Min- 
nesota Multiphasic Personality Inventory. 


PROCEDURE 


The MMPI data described here were obtained as a part of a larger psycho- 
metric study of a gerontic population. Two hundred ten individuals aged 60 years 
and more cooperated in completing a brief attitude questionnaire. These subjects 
were asked also to complete the group form of the MMPI, but very little individual 
urging was done. From the 210, 95 (45.1 per cent) completed the MMPI and ob- 
tained valid profiles defined as one in which the “cannot-say”’ score did not exceed 
110 and the raw F score was not greater than 21. Of the 115 subjects in the study not 
completing the MMPI, 70 refused to take the test, 39 began the test but stopped 
before completing it, and 6 completed the test but with ‘‘cannot-say” scores above 
the upper limit for validity. As a group, subjects who cooperated in taking the 
MMPI were younger than those who did not (median ages 71.4 and 74.8 years) ; and 
they were more often gainfully employed rather than confined to a rest home. 

The 95 tested subjects came from three separate sources: (1) so-called ‘“‘Golden 
Age Clubs,”’ (2) homes for the aged, and (3) their several places of employment. 
[he men numbered 31 and the women 64. No attempt was made to structure the 
sample so as to include or eliminate any particular type of classification of the aged, 
except that those with known mental disorders intense enough to warrant a psy- 
chiatric diagnosis were eliminated. All the subjects in the ‘‘“Homes for the Aged” 
group were confined in the home primarily for physical incapacity. 

The examiner met with the subjects individually or in small groups at their 
clubs or resident homes or elsewhere, and described the project as a ‘‘study of inter- 
ests and attitudes of the aged.’’ The group form of the MMPI was used exclusively 
in the testing. 

RESULTS 

The coding system used in this investigation is that described by Hathaway and 
Meehl?. Each of the 10 scales is represented at least once as high-point in the 
codes of both sexes, with the exception of scale 7, which has no high point in the male 
sample. Scales 0°, 1, 2, and 3 are the most frequently occurring high point scales in 
both sexes. The 0 scale appears more frequently by proportion in female subjects 
than in the male group. Combinations of scales 4 and 9 in either sex as high points 
are conspicuous by their absence. These two scales (4 and 9) appear with relatively 
high frequency among the sample“? of 258 male and 368 female adult normals in 
Minnesota. Comparison of this group with the normative group suggests greater 
statistical abnormality in the gerontic group (more codes with scales > 70). 

In the male gerontic sample, eight (25.8 per cent) of the 31 profiles have at least 
one score equal to or greater than T-score 70, and therefore are identified as ‘“‘prime’’ 


‘Read at the meeting of the American Psychological Association, Cincinnati, Ohio, September 
$, 1959. 


%) refers to the Social Introversion Scale. 
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codes. In 6 (75 per cent) of these ‘‘prime’’ codes, the 2 scale is one of the scores equal 
to or greater than 70. In the female gerontic group are 20 prime codes which con- 
stitute 31.9 per cent of the entire female sample. These prime codes appear in all 10 
of the clinical scales except the 4 scale, and show no particular point of concentration. 
The relative frequencies of the various high-point codes in the male and female 
samples are presented in table 1. Four frequently occurring one-point codes are 
TABLE 1. FREQUENTLY OCCURRING ONE Pornt MMPI Copes For 31 MALE 
AND 64 FEMALE GERONTIC NORMALS 


High-Point Code 
Males Females 
Scale Cases Scale Cases 





17 
11 
9 
6 
6 


identical in the male and female groups, but are reversed in rank (male: 3, 2, 1, 0; 
female: 0, 1, 2,3). No two-point codes appear with sufficient frequency to be identi- 
fied as typical of either the male or female group. The 9 scale is the most frequently 
occurring low point, and is found in 10 (32.1 per cent) of the male subjects and in 23 
(35.9 per cent) of the female group. The next most frequent low points in the pro- 
files are the 6 scale and the 4 scale. 

Hathaway and Monachasi"?, in their studies of 4000 midwestern ninth graders, 
concluded that the median adolescent profiles tended to be in the direction of ‘‘be- 
havior disorders’ or of psychotic trends rather than toward neurotic patterns. 
Definitely, the reverse was true in this gerontic group. The median profiles here 
were commonly of a neurotic pattern, and only rarely gave evidence ot psychosis or 
behavior disorders. The most typical combination of coded high and low points for 
both sexes in the adolescent group (Hathaway and Monachasi) was 9-2. In the 
gerontic group that combination is 3-9 for the males and 0-9 for the females. 

It is of interest to compare this sample with one other group on which there are 
MMPI data available. Schiele and Brozek“?, analyzing the MMPI profiles of 32 
adult males subsequent to a 6-month period of semistarvation, found a “‘mean pro- 
file’ strikingly similar to the median profile obtained in the present investigation. 
The high points of the semistarvation group were scales 2, 3, and 1, and the low 
point of the typical code was scale 9. The subjects in the semistarvation group had 
undergone physiologic stress which, among other things, probably had threatened 
their security. Granick®?, in describing the psychopathology of aging, stated that 
“the outstanding factor in personality breakdown in old age seems to be insecurity.” 
The possible psychologic similarity between these two groups at present can only 
be conjectured. 

In 1956, 100 female geriatric patients in a state mental hospital were evaluated 
by the writer through the use of structured psychometric devices. Sixty-six of these 
patients cooperated in taking the MMPI and produced valid profiles. Table 2 shows 
the means and standard deviations of this group on each of the MMPI scales in 
comparison with means and standard deviations of the 95 subjects of the present 
study. Because of apprehension concerning the use and comparison of ‘‘composite’’ 
MMPI profiles, no statistical comparison of these two groups is made here. Certain 
trends, however, are evident: (1) The state hospital group appears to be more var- 
iable in its personality characteristics than the so-called normal gerontic group. (2) 
The T-scores of the clinical scales in the MMPI tend to be higher among the state 
hospital geriatric group. (3) The elevations tend to be higher in the psychotic end 
of the profile (scales 6 and 8, particularly) in the state hospital group. 
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TABLE 2. TABLE OF MEANS AND S1GMaAs OF MMPI T-Scores OBTAINED FROM 
66 FEMALE State HosprtaL PATIENTS AND 95 NONHOSPITALIZED AGED 


Means Standard Deviation 
Geriatric Gerontic Geriatric Gerontic 
| patients normals patients normals 


es 11.03 


2 (D) 


3 (Hy) | 61.26 | 55.4 3.15 | 11.69 


4 (Pd) 62.71 . f 2 : | 9 137 


57 39 


65 .85 











9 (Ma) 


10 (Si) 


SUMMARY 

Without much individual pressure 95 of 210 subjects more than 60 years old 
were persuaded to take the MMPI group test. The typical or median profile for this 
group is a neurotic one with an absence of evidence of psychotic or behavior disorder 
tendencies—a finding in reverse of that demonstrated in a large group of adolescents, 
but strikingly similar to a small group of males subjected to a period of semistarva- 
tion. 

A distinct sex difference in the group occurs in the frequency of the appearance 
of the 0 scale as the high point in the profile, its presence being much more common 
among females. This finding is similar to that from the ninth-grade students studied 
by Hathaway and Monachasi"?. 

Scales 4 (Pp) and 9 (Ma) or any combination of them occur rarely as high- 
point. These two scales stand out clearly as the most common low-point codes of 
both sexes. Both sexes have fewer codes with no high point than are found in samples 
of younger normal males and females. 

Approximately 25 per cent of the males and 30 per cent of the females have 
prime coded profiles indicating T-scores of 70 or greater on at least one scale. The 
2 (D) scale tends to appear more frequently as a codable high point among males 
than among females. 
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RORSCHACH RESPONSES OF CHILDREN EXHIBITING PSYCHOGENIC 
AUDITORY SYMPTOMS 


FRANK KODMAN JR. AND JERRY E. WATERS 


University of Kentucky 


PROBLEM 

An interesting problem observed by the clinician is the youngster, preschool or 
school age, who reacts to audible sound as though he were hard of hearing when in 
fact his hearing is normal or near normal. Although this youngster will frequently 
show a discrepancy of 15 db or more between various threshold measurements, it is 
not sufficient simply to label him as a functional hearing loss case. We would argue 
on clinical grounds that functional hearing loss, when properly evaluated, is a man- 
ifestation of psychopathology. 

Juers ®), Kodman and Pattie“), Kodman, et al’, Kodman and Blanton’, and 
Myklebust “’ have called attention to psychogenic deafness in children where the 
cardinal auditory symptom was usually a subnormal response to conversation with a 
discrepancy between the pure tone thresholds and the speech thresholds. Psycho- 
genic deafness in children as a manifestation of psychopathology warrants further 
inquiry into the personality dynamics involved in addition to the measurement of the 
degree of functional hearing loss by auditory tests. This study investigates some of 
the personality characteristics of early psychogenic deafness by means of Rorschach 
analysis. 


PROCEDURE 

Eight children, four males and four females, were diagnosed as psychogenic 
largely on the basis of their general behavior and a 30 db or greater loss between 
conventional pure tone audiometry and conventional speech audiometry. They were 
matched for age and sex with eight hard-of-hearing children who showed an average 
30 db binaural loss with a range from 20-50 db. The psychogenic group showed 
essentially normal hearing (loss no greater than 5 db) by speech audiometry. The 
average functional loss for the Psychogenic group was 30 db with a range from 
20-55 db. 

The children were screened for mental retardation and aphasia by means of an 
extensive case history, parent interviews, educational and audiological evaluation, 
and psychological tests including the Rorschach, Columbia Mental Maturity Scale, 
Bender Gestalt and Draw-A-Man tests. The mean age of each group was 11 years; 
range 8-14 years. Each group fell within the normal IQ range with a mean of 94.0 
SD + 6. The children were selected carefully from our diagnostic caseload over a 
three year period. Only the more meaningful characteristics of the data are reported 
here. 


RESULTS 

The average number of Rorschach responses for the Control and Psychogenic 
groups was essentially the same. The mean R values of 23.25 and 23.38 compare 
favorably with the mean R of 27.15 shown for normal children by Thetford, Molish 
and Beck ‘*’. The mean numbers of W, D, and Dd, frequently called the approach, 
were 3.88, 16.88, and 2.50 for the Control group as compared with 2.13, 19.38 and 
1.88 for the Psychogenic group. The Wilcoxon Matched-Pairs Signed-Ranks Test “7 
showed no significant differences between these factors. 

The Control group exhibited a mean Z value of 14.13 which approximates the 
Thetford, Molish and Beck Z of 11.05 and is higher than the mean 4.06 Z value ob- 
tained for the Psychogenic group. A total ranks value (T) of 4.0 indicates that the 
difference between the Z values for the two groups is significant at the .05 level. 

Constriction of the responses of the Psychogenic group as indicated by the low 
Z value, is further corroborated by few human movement and color responses. Three 
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psychogenic subjects gave color determinants. When color was reported, it was con- 
trolled, that is, all color responses were FC for the Psychogenic group. In contrast, 
four of the hard-of-hearing control subjects utilized color with a tendency to use 
uncontrolled C, CF+ and CF— types of determinants. The Ranks Test was not 
applied since too few subjects produced color responses. However, the mean sum C 
value of 2.06 of the Control group compares favorably with the sum C norm of 2.02 
reported by Thetford, Molish and Beck for their total group of normal children. 
Both groups made infrequent though normal use of white spaces. 

Shading or Y responses were minimal for the Psychogenic group. Only one 
shading response (YV) was produced. Shading responses were utilized more ex- 
tensively in the Control group with five subjects producing either a Y, YF or FY 
response. However, only the FY+ response was given by two or more of the hard- 
of-hearing control subjects. Again, the most outstanding feature is the constriction 
shown by the Psychogenic group. In regard to the M/C ratio, two of the control 
subjects and three of the psychogenic subjects failed to produce either an M or C. 

The mean F+ percentages of 55.88 and 74.50 as well as the mean extended F+ 
percentages of 59.75 and 76.20 are much higher for the Psychogenic group. The 
differences in both percentages between the two groups are significant at the .02 level 
according to the Ranks Test. The mean F+ percent for the Psychogenic group more 
closely approximates a mean F+ percent of 70.05 with a standard deviation of 11.54 
reported by Thetford, Molish and Beck. Therefore, it seems that the Psychogenic 
group is demonstrating good form perception indicative of good ego strength and 
respect for reality. In contrast, the mean F + percent of the Control group is beyond 
one standard deviation of the expected mean. For adult norms, Beck“? uses the 
third standard deviation as the critical F+ percent for the healthy personality. The 
Control group F+ percent of 55.88 falls within the second standard deviation and 
may be interpreted as being in the direction of the neurotic personality. 

The constriction of the responses of the Psychogenic group is again evident in 
the high F percent of 94.75. Thetford, Molish and Beck found a mean of 79.80 for 
F percent which closely parallels the mean of 79.00 obtained for the Control group. 
A total ranks value of 1 (P = .05) indicated that the F percent obtained for the 
Psychogenic group was significantly greater than the F percent obtained for the 
Control group. 


SUMMARY 


This investigation was concerned with the personality characteristics of a 
sample of children diagnosed as psychogenically hard of hearing. They were com- 
pared with a Control group of hard-of-hearing children. The Psychogenic group 
manifested certain hysteroid type characteristics. An extended report may be ob- 
tained from the authors. 
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NUDITY IN HUMAN FIGURE DRAWINGS 
GARY M. FISHER! 


Fairview State Hospital, Costa Mesa, California 


PROBLEM AND PROCEDURE 


Machover": ». 7 has suggested that those who underclothe the drawing of the 
human figure (designated as body narcissists) have ‘“‘more than the average degree of 
sexual maladjustment”, and also that sexual maladjustment is indicated when the 
opposite-sex figure is drawn first. This study relates the sex of the first-drawn figure 
to the presence of nudity in the figure drawings of 1154 white male adolescents (14 to 
19 years) who were incarcerated for de linquent behavior. The great majority of Ss 
had histories of chronic delinquency, but no cases involving sexual crimes or devia- 
tions were included in the study. 

Each drawing was placed in one of the following five categories: fully clothed; 
partially nude (bathir 1g suits, trunks, bare chest, etc.); nude with genitals; nude 
without genitals; outline of figure only (neither clothed nor nude). A test of the 
significance of the difference between Banner the proportions was performed for 
each category between Ss who drew the male figure first and those who drew the 
female figure first. 


RESULTS 
The male figure was drawn first by 85.3% of the Ss, a finding which is in agree- 
ment with the results of previous studies.“ Table 1 presents a comparison of the 
drawings of Ss who drew the male figure first and those who drew the female figure, 


TABLE 1. DIFFERENCES IN INCIDENCE OF NUDITY IN FIGURE DRAWINGS ACCORDING TO THE SEX OF 
THE First-DRAWN FIGURE 


Male Figure Female Figure Critical 
Drawn First Drawn First Ratio 
(N 984) 
Drs awings N % 


Fully clothed 
Male figure 
Female figure 


Partially nude 
Male figure 
Female figure 


Nude with genitals 
Male figure 
Female figure 


Nude without genitals 
Male figure 
Female figure 


Outline of figure only 
Male figure 
Fe »male figure 


Ss indicating nudity in 
one or both drawings 148 : y 36.! }. 72 GO1 


first. Ss who drew the female figure first indicated nudity significantly more frequent- 
ly in both the male and female figures in all three categories of nudity (partially nude, 
nude with genitals, nude without genitals) than did Ss Ww ho drew the male figure 
first. Of the Ss who drew the female figure first, 36.5% gave some indication of 


1 thoees ciation is expressed to Connell Cowan and John Stamford for their assistance in this study. 
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nudity in one or both of their drawings whereas only 15% of the Ss who drew the male 
figure first did so. In addition, clothed figures were drawn significantly more fre- 
quently by Ss who drew the male figure first than by those who drew the female 
figure first, and no differences existed between the two groups with respect to the 
frequency with which outline-figures (neither clothed nor nude) were drawn. Thus, 
two drawing characteristics which purportedly reflect sexual maladjustment have 
been shown to be positively related to each other. 


SUMMARY 
Indications of nudity in the human figure drawings of 1154 white male adoles- 
cent delinquents were related to the sex of the first-drawn figure. Ss who drew the 
female figure first had a significantly greater incidence of nudity (partially nude, 
nude with and without genitals) in both the male and female figures then did Ss 
who drew the male figure first. Two figure drawing characteristics purportedly re- 
flecting sexual maladjustment have been shown to be positively related to each other. 
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PERFORMANCE CHANGES IN REPEATED JUDGMENTS OF 
SCHIZOPHRENIC VERBAL RESPONSES! 
STANLEY BLUMBERG 


University of Texas, Southwestern Medical School 


PROBLEM 

This paper extends the study of judgments of disorganization in schizophrenic 
thinking into the area of learning by investigating the effects of knowledge of results 
and of practice in the absence of external reinforcement on clinical judgments. Since 
there is some evidence from studies”? in the area of perceptual learning that ab- 
solute judgments by the method of single stimuli improve with practice even without 
correction, or knowledge of results, clinical judgments might also be expected to be- 
come more accurate with practice. Also, since all experiments providing a contrast 
between practice with and without knowledge of results show a clear superiority of 
the condition with knowledge, clinical judgments should improve faster under the 
latter condition. A review of a wide variety of judgmental and perceptual-motor 
studies") using knowledge of results for reinforcement presents the general con- 
clusion that learning is faster and reaches a higher level with knowledge of results, 
with more specific knowledge leading to more rapid improvement than less specific 
knowledge. 

This study was designed to test the effects of three experimental conditions on 
repeated clinical judgments. The hypotheses were as follows: (a) Practice in making 
judgments, in the absence of information as to their accuracy, would nevertheless 


'This study is part of a larger project being conducted at Northwestern University under the 
direction of Professor William A. Hunt through Contract 7onr450(11) with the Office of Naval Re- 
search. The opinions expressed are those of the author and do not represent opinions or policy of the 
Naval service. An extended version of the study was submitted to the Graduate School of North- 
western University in partial fulfillment of the requirements for the Ph.D. degree. The author grate- 
fully acknowledges the guidance and assistance of Dr. W. A. Hunt, committee chairman, and Drs. R. 
I. Watson, Janet A. Taylor, and W. F. Hill. 
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lead to more rapid, reliable, and valid judgments, thus indicating that learning had 
taken place. (b) Specific knowledge of the correct judgmental answers would result 
in even more rapid, reliable, and valid judgments. (c) General, or less specfic, in- 
formation as to the correct answers would lead to an intermediate position, with 
respect to learning correct judgmental responses, between practice without feed- 
back information and the specific reinforcement condition. (d) These three condi- 
tions would result in transfer of training (more rapid, reliable, and valid judgments 
when new stimuli were presented to be judged. 


METHOD 


Subjects. Three groups of judges, 30 in each group, comprised the subjects (Ss 
in the study. All Ss were drawn from one introductory psychology class for college 
undergraduates and were assigned randomly to the groups. All Ss were naive, having 
never previously served in a judgment experiment 


Procedure. Each S, tested individually, rated on a 7-point scale for severity of 
disorganized thinking 21 schizophrenic verbal responses to vocabulary test items 
previously standardized by Hunt and Arnhoff®’. These schizophrenic responses, 
selected to cover the entire stimulus range with three stimuli roughly at each of the 
seven scale points, were presented one at a time by a slide projector to which a timer 
was connected. The apparatus is described elsewhere’. The series of 21 stimuli was 
repeated six times in a different random order for each presentation. Four practice 
stimuli not contained in the regular series preceded the experiment proper. 

Group I Ss received only practice in making their judgments; i.e. they were 
given no information as to the accuracy of their ratings. Group II Ss were told after 
each rating whether their judgment was correct, too high, or too low. The third 
group Was given the correct scale value for each item after each judgment was made. 
After the six training trials all Ss were given a new list of 21 items, equated in scale 
values to the first list, which they rated twice without receiving any knowledge as to 
the accuracy of their judgments. 


Measures. Three measures of performance on the first six trials were obtained 
as indices of learning. These were latency, or time of judgmental response, reliability 
of the ratings as measured by interjudge agreement, and accuracy, or validity, of the 
ratings as measured by product moment correlations with the established standard- 
ized values. These same measures were used for evaluating transfer of training 


RESULTS AND DISCUSSION 


As indicated in Table 1, mean judgment times over the first six trials dropped 
essentially uniformly for all three groups of judges suggesting that learning occurred 
TABLE 1. MEAN JUDGMENT TIMES IN SECONDS, MEAN RELIABILITY COEFFICIENTS AND MEAN 

VALIDITY COEFFICIENTS OvER TRIALS 


Trials 
Group ; ‘ 4 


I 
I] 
II 


I 


Mean Reliability Coefficients 
’ 805 
II 791 
III if 852 
Mean Validity Coefficients 
I 776 769 
II 697 781 
II] 730 852 
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at the same rate for all three groups. Analysis of variance of the individual mean 
judgment time scores revealed an absence of differences between groups, with trials 
as the only significant variable affecting speed of judgment. Since the time scores 
on trial seven are the same as on trial one, it should be concluded that the three con- 
ditions used in the original training situation had no differential effects on the amount 
of time taken to judge new stimuli. 

Reliability of the ratings, measured by interjudge agreement, increased over the 
six training trials, as indicated in Table 1. As predicted, Group III achieved greater 
agreement by the sixth trial than did Group II, which in turn showed more agree- 
ment than did Group I. Analysis of variance of the Z transformations of the in- 
dividual reliability coefficients indicated significant differences between groups and 
trials, and interaction between trials and groups. This suggests that specific knowl- 
edge of results leads to higher reliability of judgments than does less specific knowl- 
edge which in turn results in higher reliability than does no information (practice 
alone). 

In spite of some increase in interjudge agreement over trials for Group I the 
difference between their reliability on trial three (.831) and on trial one (.808) is not 
statistically significant (¢ = 1.15). This leads to the conclusion that in the absence of 
some kind of information as to the accuracy of their judgments, Ss’ judgments do 
not become significantly more reliable with practice at judging the same stimuli. A 
conclusion that these Ss are not learning anything, however, is not warranted by 
these results. What is meant by learning must be specified, and the most likely ex- 
planation relating the time score data to the reliability data is that all three groups 
did, in fact, learn but that Group I learned something different from what Groups 
II and III learned. Groups II and III learned the standardized values for the 
stimuli. Ss in Group I learned their own responses to the specific stimuli presented 
over repeated trials, which accounts for their more rapid response times over trials. 
This result replicates that found in the Hunt and Blumberg? study of repeated 
judgments. 

A comparison of the mean reliability coefficients on trial seven with those on 
trial one reveals improved performance (positive transfer of training) for both re- 
inforced groups but not for Group I. Group II improved in reliability from .741 to 
798 (t = 1.97, p < .05), while Group III went from .768 to .824 (t = 3.06, p < 
.005). This result suggests that, in the absence of information as to the accuracy of 
their judgments, practice in making clinical judgments of this kind does not lead to 
improvement (higher reliability) in judging new but similar material. On the other 
hand, such improvement does appear to occur following practice with such informa- 
tion, with the greatest improvement resulting from the specific knowledge of results 
training condition. 

The results with respect to validity, the correlation of each S’s judgments with 
the standardized values of the stimuli, resemble those obtained using reliability as 
the measure of performance. Group III showed the most rapid learning, or improve- 
ment in terms of the validity of their ratings, over the six training trials, followed 
by Group II, followed by Group I as shown in Table 1. Analysis of variance showed 
significant differences between groups and trials. Again, there was no significant im- 
provement in Group I’s ratings. 

Apparently, while each S in Group I is learning his own responses to the specific 
stimuli, and thus becoming more consistent with himself, he is not becoming more 
consistent. with his own group of judges (reliability in this study) nor with the group 
of experienced clinicians upon whom the items were standardized (validity in this 
study). This is in accord with the finding of Seashore and Bavelas, reported in 
Ammons’ review, that repeated practice in the absence of any knowledge of results 
may result in an increasing self-consistency although no improvement in perform- 
ance in terms of accuracy with the experimenter’s standard occurs. Thus the hypo- 
thesis, based on some of Gibson’s? findings, that Group I Ss would improve in their 
reliability and validity of judgments in the absence of knowledge of results was not 
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confirmed. The hypothesis, or assumption, was that during the first few trials Ss 
would learn more about the range of stimuli and correct prior judgments which they 
felt to be in error as the context became better established. 

As for transfer of training, using mean validity coefficients as a measure of per- 
formance, we find Group II obtaining .697 on trial one and .751 on trial seven, a 
difference which is not significant at the .05 level (¢ = 1.57, p < .10). Group III, 
however, improved from .730 to .800, a difference which is significant (¢ = 3.24, 
p < .005). Again, Group I showed no improvement, and in no case was perform- 
ance on trial eight superior to that on trial two. Apparently, the specific reinforce- 
ment led to the greatest amount of learning (learning the ‘“‘correct’’ scale values for 
the first list of items) and also resulted in the highest performance, as measured by 
both reliability and validity, when Ss were presented with the new list of stimuli on 
trial seven. 

In conclusion, the results support the hypothesis that practice at rating ‘‘clinical 
stimuli” of this type leads to more rapid judgments upon repeated presentations, 
but does not lead to more reliable or valid judgments of either the same or new 
stimuli. On the other hand, practice with a general knowledge of the relative ac- 
curacy of one’s judgment results int more reliable and valid ratings of the same ma- 
terial, but this may not transfer to new stimuli. Practice with specific knowledge of 
the correct rating apparently results in more reliable and valid judgments of not 
only the same stimuli, but also highly similar new stimuli. 


SUMMARY 


Three groups of naive judges, 30 to a group, rated 21 schizophrenic vocabulary 
test responses on a 7-point scale for disorganization of thinking. The judgments 
were repeated over six trials. One group received only practice in making their judg- 
ments; 7.e., they were given no information about the accuracy of their ratings. The 
second group of subjects were told after each rating whether their judgment was 
correct, too high, or too low. The third group was given the correct scale value for 
each item after each judgment was made. After the six training trials all subjects 
were given a new list of 21 items, equated in scale values to the first list, which they 
rated twice without receiving any knowledge as to the accuracy of their judgments. 

The results indicated that practice, without information as to the accuracy of 
judgments, did not lead to improvement, 7.e., higher reliability and validity, in 
clinical judgments; but that both specific and less specific knowledge of the correct 
ratings did. Only the training condition involving specific knowledge resulted in 
transfer of training, 1.e., statistically significant improvement in both reliability and 
validity of judgments of new material following original training. 
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BELOW CHANCE PERFORMANCE ON A VERBAL TEST USING 
PAIRED CONCEPTS 


E. FRANCESCO 
Box 601, Ithaca, N. Y. 


INTRODUCTION 

In the design of tests that have dichotomous response alternatives, the usual 
procedure is to balance the test so that it has as many items with one alternative 
correct as the other. That is, in a test with true-false type answers, there are gener- 
ally as many items with correct true as correct false answers. The logic underlying 
this design is that when the respondent is not answering on the basis of his knowl- 
edge, he should be getting no more than half the answers correct on the basis of 
guessing, or by “‘chance.’’ The design is assumed to be valid for any respondent 
strategy of answering if the items are presented in random order. Guessing all true, 
all false, different or equal proportions of true or false, at random or by any system, 
should not improve the score of the respondent. There is an additional assumption, 
however, that must actually be satisfied for this to be true, namely, that the test also 
should be balanced with regard to the difficulties of the items associated with the 
two alternatives. 

It is well to illustrate a strategy that would beat the system if the alternatives 
are not balanced with regard to difficulty. Suppose that in a hypothetical test the 
items that have correct answers that are true are easier than those that are false. A 
respondent might take the test answering all the items he is quite sure about first. 
Suppose that he finds that of 100 items he has an$wered 25 true and 5 false. Then, if 
he assumed that there is a balanced number of true and false items in the test, he can 
simply answer all remaining items as false and get a better score than by guessing 
each individually. By the former strategy he woyld get a score of 75 correct, by the 
latter only 65 correct. Of course, it is the choice of strategy itself that involves un- 
certainty for the respondent at this point. It might be easier to reason that the test 
is biased in the direction of inclusion of more true items than false ones, since the 
number of true ones the respondent recognized far exceeded the number of false 
ones. With this strategy, i.e., by answering all remaining items as true, the respond- 
ent would get a score of 50, which means that his strategy actually counteracted any 
advantage that he would have achieved on the basis of his actual knowledge. The 
“‘safe’’ strategy is the one of guessing items individually, which presumably in the 
absence of knowledge is a random set of responses. 

The parallel situation occurs when tests are unbalanced in the number of items, 
assuming the difficulties of the items associated with the two alternatives are equiva- 
lent. In this case, the best strategy for a test, once the proportion of true and false 
answers is assessed, would be to assign all remaining answers to the response altern- 
ative with the larger proportion of known correct answers. 


ILLUSTRATIVE MATERIALS 

In the development of the Verbal Power Test of Concept Equivalents (VPT), 
these considerations of difficulty entered the design prominently. The objective of 
balance in number and in difficulty did not appear possible on the basis of actual 
experience. The instructions for the VPT define the task as the recognition of con- 
cept pairs that are the Same (e.g., neat-tidy) or Different (e.g., foot-shoe). As has 
been previously reported °), the task of recognition of Different pairs was judged 
easier than that of recognition of Same pairs on the empirical basis of respondent 
performance. Thus, the VPT forms were designed to each have 63 Same items and 
42 Different items; the logic simply was to provide for a balanced appearance if one 
is answering many of the Different items correctly and has insufficient knowledge 
to answer the Same items and must resort to guessing or to choosing a test strategy 
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for answering. The design of the test maintained the same ratio of Same and Differ- 
ent in a revision, but the ratio was increased to 46 to 29 when the number of items 
for each of the forms was reduced in accord with experience on the test, and both 
forms of the test were placed on a single IBM response form.' The experience dis- 
cussed below refers to the 75 items that compose the A form of the test. 

Interest in the question of below chance performance arose out of a sample of 
respondents in which a substantial proportion of VPTs were completed at or below 
the manifest chance level, when this was assumed to be half of the dichotomous 
responses answered correctly.2, In this sample about 40 per cent of respondents 
(Table 1) had 39 or fewer items correct out of the possible 75. About 10 per cent had 
34 or fewer correct. The question that came to the fore was the obvious one. Since 
the VPT forms had been devised on ‘‘average’”’ subjects, and this group reasonably 
fits the description*, what were the characteristics of responses at or below the mani- 
fest chance level? 


TaBLE 1. Meran Same anv Different Correct REsPoNsEs BY TOTAL 
Nv MBER CORRECT 


Total Mean 
number Number of Different 
correct cases correct correct 


25-29 3 
30-34 20 
35-39 54 
40-44 46 
45-49 29 
50-54 19 
55-59 
60-64 
65-69 
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The data from Table 1 are plotted in Figure 1. It is seen that while the mean 
number of Different items that would be expected to be answered correct at the 
chance level is 14.5, the mean number correct does not go below 17.2. The mean 
number of Same-items that would be expected to be answered correct would be 23, 
but the mean number actually gotten correct goes well below this, and in fact per- 
sons with from 40-44 correct in the total score are performing with a mean number 
Same correct just slightly under 23. Because of the apparent fixed minimum number 
of Different items correct, for persons operating below the manifest chance level on 
the total scores the ratio of correct Same-to-Different is inverted relative to the ratio 
of available items. 


Discussion 
We may now look to some of the possible interpretations of the findings in 
terms of the strategies of response.‘ What does the task of recognizing Same or 


— of the test was set primarily on considerations of improvement of correlation to external 
measures. In the current length of forms (75 items each) improvement is not marked by doubling the 
length of the test. For example, in a sample of high performing housewives in the Broome County 
Extension Service. leader training program (N = 143), the correlations of forms A and B to the Quick 
Word Test were .77 and .80 respectively, while the combined forms correlated at .83. The between 
forms correlation was .80 for this restricted variance group. A supply of the new IBM forms of the 
VPT are available for research in studies of adult intelligence by writing the author (Box 601, Ithaca, 
W. Eb 

*These data were provided through the generous cooperation of Dr. James H. Panton, Supervisor, 
Reception Center, State of North Carolina Prison Department. 

*This group had a mean total IQ on the WAIS of 99.0, but the SD was very small (10.4); the 
population SD for the WAIS is given as 15, indicating the highly restricted variance in this sample. 

‘Many of the comments on strategy arise, of course, from interviewing respondents at various 
times. There are limitations to this kind of procedure, however, best described as the intrusion of the 
respondent’s concern with the interpretation of the concepts, or ‘“‘why I got that one wrong.”’ 
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Figure 1. Mean Same and Different Correct Responses by Total Number Correct 
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Different concepts actually involve? First, there are two clear conditions of accurate 
response when both concepts are known. Second, there is the possibility that only 
one is known or both are not known, and then it would be expected that some 
strategy of response would operate. If neither is known, there is no basis for judg- 
ment, and thus any guess might be appropriate. If one is known but not the other, 
this might well provide the situation for an assumption that it is a mismatch, since 
the compared concept doesn’t mean to the respondent what the known concept 


means. The evidence, however, seems to point simply to a tendency to answer 
Different when one does not know that they are the same. Thus, decrease in total 
scores on the VPT appears to be consistently accompanied by an increase of in- 
correct or Different responses on the Same items. If scores even lower than those 
found were to occur, extending the observed trend we would expect them to occur 
because more Same items were being responded to as Different. Further, if the trend 
were correctly assessed, the tendency should be for more Different items to get cor- 
rectly answered as an artifact of the increased lack of knowledge. In Figure .1 an 
increase in the number of Different items answered correctly is not noticeable, but 
it should be recognized that, for example, in the group having 30-34 total number 
correct a mean score of 15.3 Same items correct means that 30.7 are incorrect, or 
about 67 per cent of the Same items are answered as Different. Only 17.2 or 59 per- 
cent of the Different items are answered as Different. In other words, at this level 
respondents are answering Different items as Different less frequently than they do 
the Same items, and although only three cases are involved, this is even more ac- 
centuated for the group having 25-29 total number correct. 

One other way to examine the concomitants of lack-of-knowledge as a variable 
is to select the respondents on the basis of another criterion. Here we may look at 
the low end of the distribution on the WAIS, and we find that for the nine cases with 
full scale scores under 80, the total mean number correct was 32.5, and these were 
divided into 12.1 Same and 20.4 Different correct. About 74 per cent of the Same 
items are being answered as Different while only about 70 percent of the Different 
items are being answered in this manner. The Different items have, so to speak, a 
resistance to being seen as Different that is greater than that of the Same items. 
Presumably, however, if persons were chosen who had insufficient knowledge, Same 
and Different items would be responded to identically. 

Here, however, the essential point is clear. The test is operating with inter- 
pretable characteristics below its manifest chance level. Some limitations need to be 
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pointed out. A person who operates on a random basis will get a better score (mean 
37.5) than persons who have some knowledge and operate on the kinds of bases in- 
dicated, namely to assume generally that when only one concept is known or both 
are not known, they are Different. On the other hand, dichotomizing persons getting 
scores of 37 and 38 (manifest chance level) by the ratio of Same to Different correct 
responses did not indicate any corresponding difference on WAIS scores 

It should be noted that the proportion of Different scores seems to have an upper 
limit, possibly at about 75 per cent. (No case of all Different responses has been en- 
countered yet.) It may appear unreasonable to respondents as a general strategy of 
test taking to believe that all the items would have responses of Different, and thus a 
counter strategy may require a certain amount of guessing that items should be 
scored as Same. According to informal interviews, this seems to be a secondary con- 
sideration, ostensibly placing a limit on the tendency to respond to items individ- 
ually as Different. This same kind of a consideration prevents the test from being 
made up entirely of Same items. If all were Same items, there probably would be a 
point where persons who know the paired concepts would give their attention to 
over-all strategy instead of responding to the items. For example, persons knowing 
most of the items might be inclined to favor the Different response in order to keep a 
reasonable proportion of both kinds of answers, while others might guess the compo- 
sition of the test and score them all as Same. 


SUMMARY 
Characteristics of response below the manifest level of chance on a verbal test 
(V PT) are examined and are indicated to be meaningful. Considerations of strategies 
of response to individual items and to the test as a whole are discussed 
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SOCIAL DESIRABILITY AND THE TYPE OF HOSTILITY 
EXPRESSED ON THE TAT 
MARVIN REZNIKOFF AND ADELAIDE DOLLIN 


Institute of Living, Hartford, Conn. 


In a recent investigation, Allison and Hunt“ studied the relationship between 
aggression and the Edwards social desirability (SD) variable®?. Positing that verbal 
aggression is generally seen as undesirable in our culture, they tested the proposition 
that high SD people as measured by the Edwards Scale would be likely to express 
less aggression than low SD people. Utilizing a specially devised paper-and-pencil 
“Situational Frustration Test’’, these investigators obtained a significant inverse 
association between amount of aggression elicited on this test and independent SD 
ratings. They concluded that their results could be taken to indicate construct 
validity of the Edwards SD concept. 

While Allison and Hunt focussed entirely on quantity of aggression and SD, the 
present study examined amount and another factor of potentially equal importance 
—the type or kind of aggression in relation to SD. In this regard it could be con- 
jectured, for example, that individuals with a strong SD orientation might be basic- 
ally quite hostile but consistently manifest their hostility in a less direct manner. 
Specifically, then, expressions of overt and covert hostility were compared in a group 
of subjects categorized according to their scores on the Edwards SD Scale. 
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PROCEDURE 


Forty-eight psychiatric inpatients falling into a variety of non-organic psycho- 
diagnostic categories participated in this study. All were of average intelligence or 
better and had at least a year of high school. An Edwards SD Scale and six TAT 
ecards were administered to each. The six TAT pictures were selected to sample a 
range of emotional tone, two being rather happy or pleasant (2 and 10), two unhappy 
or sad (3 BM and 13 MF), and the remaining two (1 and 4) intermediate in their 
connotation ° 

Each of the six TAT stories collected from the forty-eight subjects was eval- 
uated for hostile content by two judges working individually and utilizing a hostility 
scale developed by Hafner and Kaplan“. This latter procedure, which was modified 
slightly for the current study, enumerates and classifies TAT themes as indicative of 
either overt or covert hostility and assigns weights to them on a four point scale. A 
reliability coefficient of .90 was obtained between the independent ratings of the two 
judges, employing the Hafner and Kaplan criteria of aggression. 


RESULTS AND DISCUSSION 


On the basis of their Edwards SD scores the subjects were divided into four 
equal groups. The ratings of the two judges were pooled and two TAT cards each 
combined under the three major categories of happy, neutral and sad. A double 
classification analysis of variance design with replicates was then used to compare 
the quantities of hostility found in four SD groups, as revealed in their responses to 
the three pairs of TAT cards. 

As ean be seen from Table 1 the only significant differences obtained were in 
comparisons of the TAT card categories. The four SD groups did not differ in 
amounts of overt, covert or total hostility expressed. Neither were there any sig- 
nificant interactions between the TAT stimulus values and the SD groups. 


TABLE 1. ANALYSES OF VARIANCE OF HosTILity RATINGS 


Analysis Sof V 





Overt hostility Groups P .85 
TAT : 56.7 .83 
Groups X TAT 24 
Error 

Covert hostility Groups ‘ , .50 
TAT : 50.0: 54 
Groups ) . 74 
Error 

Total hostility Groups ‘ 2.11 n.s. 
TAT , 202.76 001 
Groups X TAT 6 4.98 n.s. 
Error 132 4.89 





The second and more important analysis for purposes of the paper, involved 
evaluation of type of hostility manifested. Accordingly a sign test ©) was performed, 
ranking overt and covert hostility with respect to each other for each subject in the 


TABLE 2. Sicn Test APPLIED TO OVERT AND Covert HostiILity In EXTREME 
SD Groups (O = Overt; C = Covekrt) 





Distribution of Subjects 
Group >C O=C C>O0O Total 


Low SD 


High SD 
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two extreme SD groups. The data deriving from this procedure are reported in 
Table 2. It can be seen that in the low SD group five of twelve patients expressed 
more covert than overt hostility with three expressing equal proportions of each—a 
finding strictly explainable on the basis of chance. This, in opposition to the per- 
formance of the high SD group in which nine of twelve subjects exhibited more 
covert than overt hostility, which significantly departs from chance expectations. 

Certain of the current study’s results are not in agreement with those reported 
by Allison and Hunt, indicating that subjects receiving high scores on the Edwards 
SD Scale express significantly less aggression than subjects with low SD ratings. The 
fact that the earlier investigation used college undergraduates and student nurses 
while the present population was composed of psychiatric patients may partially 
account for the disparity. 

Most likely, however, the findings are at variance with each other primarily be- 
cause of the nature of the two tests employed to elicit aggression. The frustration 
test developed by Allison and Hunt was a minimally disguised, comparatively direct 
measure of aggression utilizing multiple choice responses. Such a procedure would 
doubtless tend to sharpen existing disparities between SD groups. The TAT, by 
contrast, a projective technique, involving relatively ambiguous stimulus material 
not explicitly designed to evoke aggressive feelings might, in effect, be undercutting 
the operation of an SD set governing hostile expression. On the other hand, the find- 
ing that high SD subjects are significantly more covert than open in their aggression 
while low SD individuals are not, would seem to be in line with general SD theory. 
This aspect of the data would also emphasize the importance of recognizing type as 
well as quantity of aggression as a variable associated with SD orientation. 


SUMMARY 

The Edwards Social Desirability (SD) Scale and a six card TAT were adminis- 
tered individually to each of forty-eight psychiatric inpatients with the principal ob- 
ject of determining whether the kind of aggression expressed, covert or overt, was 
related to SD ratings. Based on their SD scores, the subjects were separated into 
four equal groups and their TAT evaluated by two judges for quantity and type of 
hostility utilizing a procedure developed by Hafner and Kaplan specifically for this 
purpose. Analyses of variance revealed no significant differences among the four 
groups with regard to amounts of covert, overt and total hostility. A sign test, how- 
ever, comparing overt with covert aggression for each subject in the two extreme 
groups, indicated that a significantly greater number of individuals than chance in 
the high SD group rated as having more covert hostility, while in the low SD group 
no consistent direction was evident. The findings were contrasted with an earlier 
study of aggression and SD, and briefly discussed. 
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\ LONGITUDINAL COMPARISON OF INTELLIGENCE INVOLVING 
THE WECHSLER-BELLEVUE I AND WAIS 
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PROBLEM AND PROCEDURE 


In an intelligence testing course in 1950, the WB I was given under supervision 
to 28 unselected subjects ranging in age from 18 to 72 and varying widely in educa- 
tional and occupational backgrounds. These same subjects were located in different 
parts of Florida, North Carolina, and Alabama and retested with the WAIS in 1960. 
In effect, this ten year interval made possible a longitudinal study. Admittedly 
the N is small and the group is varied, as no replication design had been planned. 
However, since longitudinal studies involving the same subjects tested with WB I 
and WAIS are rare, it is interesting to note the results for whatever contribution 
they might make to the problems of IQ constancy, decline of intelligence with age, 
and comparisons of two well known tests. 


RESULTS 

In Table 1 the 1950 group tested with WB I, ranging in age from 18 to 72 years 
with an average age of 30.57, is compared with the same group tested with WAIS 
ten years later in 1960 when the age range was 28 to 82 and the average age was 
40.57. The group was composed of 16 males and 12 females; no significant sex differ- 
ence between mean 1Q’s was found. The range of 1Q’s on WB I for 1950 was 105 to 
133 and for WAIS in 1960, 97 to 137. 

Table 1 shows that the group means for both tests are high, indicating that the 
group is mostly composed of bright to superior people. Even though this is a rather 
selective group, the IQ’s remained reasonably constant over the ten year period. 


TaBLeE 1. CoMPARISON OF WB-I anp WAIS, FuLL ScaLe, VERBAL SCALE, PERFORMANCE SCALE AND 
SuBTEST ScALED Scores (N = 28). 


Subtests WB I 1950 WAIS 1960 
Mean SD Mean SD P r 





Full Scale 118.57 8 .: 119.29 8.81 y 82 





Verbal Scale ] 
Information 
Comprehension 


17.71 20.86 9.32 3. 83 

1 . & 

1 
Arithmetic l 

l 

1 

l 


HY : 3.79 2.06 

2! 2.26 4.50 2.57 2% 35 
64 2.83 58 

Similarities 89 2.41 : ) 5 41 

Digit Span 

Vocabulary 


0.46 2.67 1.11 3.53 2 69 
3.81 2.34 3.8 72 


Performance Scale 116.18 9.88 5.18 54 § 45 
Digit Symbol 11.9% 2.67 00 3.00 2.7 81 
Pict. Comp 11.5 3.1} 96 2.18 ) 38 
Block Design 13.8 2.54 2.25 3.21 62 
Pict. Arr. 10 .96 2.7 00 2.80 02 18 
Obj. Assem 11.36 3.21 10.50 2.43 ; 10 .49 








Although the results obtained are in some degree comparable to a few previous 
studies of WB I and WAIS such as those of Cole and Weleba“?, Goolishian and 
Ramsay’, and Dana®?, they are not directly parallel because the earlier studies 
were not longitudinal in character. 

Subtest score comparisons were made and are presented in Table 1. Digit 
Symbol, Vocabulary and Digit Span have the highest correlations. Both Vocabulary 
and Digit Symbol also showed significant differences between means. The signif- 


‘ 
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cant changes between means of some of the subtest scores are probably attributable 


to differences in standardizing procedures between WB I and WAIS. 


CONCLUSIONS 
A group of 28 unselected subjects was tested in 1950 with the Wechsler Bellevue 
I and retested in 1960 with the WAIS. The IQ’s remained relatively constant for the 
group over a ten year period. This finding would appear to attest to the validity of 
the age correlation factors and the readjustment of WAIS scores formulated by 
Wechsler. 
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PSYCHOLOGICAL EFFECTS OF ISOCARBOXAZID AND 
ON A GROUP OF DEPRESSED PATIENTS 


VASSO VASSILIOU AND H. E. HIMWICH 


NIALAMIDE 


Thudichum Psychiatric Research Laboratory 
Galesburg (Ill.) State Research Hospital 


PROBLEM not present quantitative scoring problems. The 


The present investigation compared the effects 
of isocarboxazid* (Marplan, Hoffman-LaRoche) 
and nialamide* (Niamid, Pfizer), both mono- 
amine oxidase inhibitors, on a group of hospital- 
ized depressed patients in their performance on a 
selected battery of tests. Although the effects of 
these drugs are not known, it was suggested that 
the drugs have the property of reducing depres- 
sive symptoms. However, it was thought interest- 
ing to determine the effects of the drugs on other 
areas of functioning on an exploratory basis.Tests 
involving a variety of areas, such as ideational 
processes, psychomotor coordination, and mood 
were included in the test battery. 


METHOD 


Tests. The main author had no knowledge of 
the suggested effects of the drugs and consequent- 
ly a variety of tests were selected to determine 
what the effects of these drugs were. Eleven tests 
were chosen to assess changes in ideational pro- 
cesses, psychomotor coordination and mood. 
Four tests involving the Vocabulary, Digit Span, 
five problems from the Arithmetic subtest, and 
Digit Symbol were chosen from the Wechsler 
Adult Intelligence Scale. Other tests included: a 
Visual Design Memory test; an Immediate, In- 
termediate and Past Memory test; a Visual Ob- 
ject Memory test: and selected items from Sacks 
Sentence Completion test. 


Scoring. All tests, except the Sentence Com- 
pletion test, were of an objective nature and did 


Sentence Completion data were rated on eight 
bipolar scales with seven intervals in an attempt 
to differentiate ‘Drive Orientation’’ and ‘‘Per- 
sonal Assessment’’ under the three experimental 
conditions. The scales under ‘Drive Orienta- 
tion’ were: strong-weak, positive-negative, appro- 
priate-inappropriate; those under ‘Personal 
Assessment’’ were: positive-negative, happy-sad, 
active-passive, satisfied-unsatisfied, acceptable-un- 
acceptable. The same scales were used for each 
of three testings. Three psychologists rated the 
Sentence Completion test independently on the 
eight bipolar scales. To determine the degree of 
agreement among the three sets of independent 
scores the Kendall coefficient of concordance was 
used. Reliability coefficients secured for ‘Drive 
Orientation’’ were: .82, .83, .88 respectively and 
for “‘Personal Assessment”’ .80, .87, .85. Any dis- 
agreement was resolved through conference. 


Subjects and Procedure. Thirty hospitalized 
patients involving 16 females and 14 males select- 
ed on the judgment of three psychiatrists on the 
basis of depressive symptoms were the subjects 
(Ss). The mean age of the females was 48.64, 
(range 20 to 68) and the mean age for the males 
was 50 (range 34 to 72). Initially all Ss were able 
to participate in the testing and work on the 


*We are grateful to Hoffman-LaRoche, Inc., 


Nutley, N. J., and to Pfizer Laboratories of 
Groton, Connecticut for the generous supplies of 
isocarboxazid and nialamide respectively. 
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tasks assigned to them with the exception of two 
males and two females who refused to take the 
tests. Thus four Ss were eliminated from this 
study. The Ss were originally tested under condi- 
tion one (C1), after being placed on Placebo for 
eight weeks without administration of the drug. 
The second testing involved condition 2 (C2) 
following the administration of the drug eight 
weeks later. The male patients were on isocar- 
boxazid (Marplan) starting on 10 mg T.I.D. and 
the dosage was gradually increased until the 
patients responded, but not higher than a maxi- 
mum of 20 mg T.I.D. The female patients were 
on nialamide (Niamid). They started on 25 mg 
T.I.D. and went to a maximum of 75 mg T.I.D. 
Both drugs are monoamine oxidase inhibitors. 
Differences in dosage were due to the fact that 
isocarboxazid is more potent than nialamide. 
Dosages were arranged in such a way in order to 
secure equal clinical potency. Following that the 
Ss were seen and tested for a third time after a 
period of eight weeks during which all Ss were 
placed on placebo again and this was called condi- 
tion 3 (C3). Neither the Ss nor the main investi- 
gator were aware of the actual medication until 
the completion of the study. 

The 14 female Ss and 12 male Ss were combined 
and treated as a single sample in the analysis of 
the data because no dedieent differentiations 
between the two groups in their scores on the 
test material were observed. Comparison be- 
tween the male and female Ss by the Mann 
Whitney nonparametric technique produced no 
significant differences. 


RESULTS 

With the exception of Sentence Completion 
test, each test from the battery was analyzed in- 
dividually under the three conditions specified in 
the procedure. Because of the apparent skewness 
of the scores, the results were analyzed by the 
nonparametric Friedman two-way analysis of 
variance by ranks“) to determine whether the 
rank totals differ significantly with results pre- 
sented in Table 1. 

Table 1 reveals that all subtests were signifi- 
cant beyond the .01 level. In every case the score 
for Cl was the largest, followed by the score in 
C3. C2, when drug was introduced, revealed the 
lowest score. Lower scores in all subtests indicate 
a poorer performance. 


AND H. E. 


HIMWICH 


TABLE 1. MEAN SCORES AND THE FRIEDMAN 
Two-Way ANALYSIS OF VARIANCE ON THE TEN 
DIFFERENT SUBTESTS UNDER THE THREE 
EXPERIMENTAL CONDITIONS 








Test Cl C2 C3 





Vocabulary 29.5 20.34 
Immediate 
Memory 
Intermediate 
Memory 
Past Memory 
Visual Object 
Memory 
Visual Design 
Digit Span 
Forward 
Digit Span 
Backwards 34.23 
Arithmetic 27 .30 
Digit Symbol 212.30 


8.19 5.38 


24.23 18.84 
18.84 11.92 


59.61 50. 
36.92 


48 .46 





*Significant beyond the .01 level 


From the Sentence Completion test, although 
reliabilities were high, results were not analyzed 
for discussion because inspection of the test data 
do not reveal any changes in the areas of ‘‘Drive 
Orientation’? and ‘‘Personal Assessment’’. 


SUMMARY 


The purpose of this study was to explore the 
effects of isocarboxazid (Marplan) and nialamide 
(Niamid), both monoamine oxidase inhibitors, on 
depressed patients. Eleven tests were chosen to 
assess changes in ideational processes, psycho- 
motor coordination and mood. Patients were 
tested under three experimental conditions. It 


was found that isocarboxazid and nialamide 
lowered the performance of the Ss in the tests in- 
volving functions referring to higher mental pro- 
cesses and psychomotor coordination. 
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A CROSS-VALIDATION OF LEARY’S 
“PUBLIC” COMMUNICATION LEVEL 


WALTER G. KLOPFER 


University of Portland 


PROBLEM 

Leary“? has specified operational measures of five levels of interpersonal be- 
havior. These include the Level of Public Communication (the way in which an 
individual is perceived by people in general), the Level of Conscious Perception (the 
way in which he perceives himself), the Level of Private Symbolization (the way he 
is evaluated on the basis of projective tests), the Level of the Unexpressed Uncon- 
scious (significant omissions at the preceding levels), and the Level of Values and 
Goals (perception of the ideal person). Of all these, the first, or ‘‘Public’”’ level, has 
been the most generally ignored in typical personality assessment programs, due 
largely to the difficulty of attacking it directly in a cross-sectional evaluation of an 
individual. Nevertheless, it woyld seem to be a vital part of any comprehensive 
method of evaluating interpersonal behavior since it will lead directly to predictions 
of social stimulus value, which in turn will enable the diagnostician to evaluate the 
tendency on the part of the individual being assessed to provoke various kinds of re- 
actions on the part of other people. Thus, a person who at the ‘“‘Public”’ level demon- 
strates dominance is likely to provoke submission; if he demonstrates aggression he 
is likely to provoke rebellion; if he demonstrates docility he is likely to provoke 
responsible, managing behavior, etc. 

Leary’s method of predicting such “public’’ behavior is to use formulae de- 
rived from various scales of the MMPI. The two main dimensions of his inter- 
personal system are Dominance-Submission and Love-Hate, and the two together 
form coordinates on which a summary point can be plotted. These summary points 
lead to more specific predictions of habitual interpersonal behavior as it is perceived 
by people in general. The question being attacked in this study is whether the pre- 
dictions based upon the MMPI can be cross-validated by using an independent pre- 
dictor of ‘“‘Public” interpersonal behavior along the two dimensions specified in 
Leary’s system. 


PROCEDURE 

Thirty-one outpatients seen in a university clinic were given a complete battery 
of psychological tests, including the MMPI, and an interview. After evaluating all 
the material, with the exception of the MMPI, the psychologist was asked to fill out 
a copy of Leary’s Interpersonal Adjective Check List on the patient in terms of how 
he is likely to be perceived by people in general. Thus, the clinician is asked to give 
a prediction of Public Level behavior of the patient, based upon a multi-level assess- 
ment procedure. The Adjective Check List was scored according to Leary’s stand- 
ard system, and measures of Dominance and Love on each individual thus be- 
came available. The MMPI scales were then analyzed and predictions made ac- 
cording to Leary’s usual system on the same two dimensions. The group of patients 
was then paired in terms of the Love and Dominance dimensions as to how con- 
sistently their ‘‘Public” behavior was being predicted on the MMPI and the IACL. 


RESULTS 


The Pearson product moment correlation of Dominance as measured by the 
MMPI and IACL was .29, which is not significantly different from chance. A similar 
correlation of Love, however, yielded a correlation of .54, significant at the .01 level. 
It would be of major value if a standard psychological instrument like the MMPI 
which is self-administering and scorable clerically, could indeed produce valid pre- 
dictions of ‘‘Public’’ behavior. 
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SUMMARY 


This study has tended to validate Leary’s system of predicting ‘‘Public’’ be- 
havior from MMPI formulae in part, in that Affection and Hostility are apparently 
predictable validly from the MMPI as judged by the criterion herein employed. 
However, the Dominance dimension fared less well. It is difficult to know from the 
results of the present study whether this is because the Dominance dimension lends 
itself less easily to prediction by clinicians on the basis of a standard clinical exper- 
ience, or whether indeed Leary’s system is weak in part. Further research is in- 
dicated to refine further the degree to which the MMPI can be used to predict 
‘‘Public”’ behavior in the manner suggested by Leary. This might take the form of 
deriving and cross-validating new empirical scales of Love and Dominance from the 
MMPI on the basis of Ss judged clinically on these dimensions. 


REFERENCES 
The Interpersonal Diagnosis of Personality. New York: The Ronald Press Co., 


CHANGES IN ESSENTIAL HYPERTENSION OVER TIME: A FACTOR 
ANALYTIC CASE STUDY* 


G. A. MILTON 


University of Colorado 


Early in the literature on hypertension, Alexander demonstrated a relation 
between the ‘‘emotional state’’ of an hypertensive patient and changes in his blood 
pressure over time." »- *) Specific emotional states, behaviors, and personality 
characteristics have also been attributed to the hypertensive patient as differentiated 
from Patients with other psychosomatic disorders and non-patient populations. 

p. 292; 2, p. 746; 3) The most often noted characteristics are repressed hostility and 
onan about sex-role identification. To the extent that these and other emotional 
states are important in the elevations in blood pressure of an hypertensive it is to be 
expected that correlations would be obtained between fluctuations in blood pressure 
and the specific variables. The present study investigates the relationships over 
time among blood pressure, personality dynamics discussed during therapy, and 
overt behaviors for a single hypertensive patient. 


THE PATIENT 

The patient was a 38 year old white male hypertensive who was still in the 
fluctuating stage of hypertension. His blood pressures ranged between 120/70 and 
140 /200 during the nine months period in which he was followed. He was hospital- 
ized in a G M and § hospital during this period and was receiving biweekly psycho- 
therapy. Although he had a mixed psychiatric diagnosis with both characterological 
and obsessive-compulsive features, he displayed a classic hypertensive syndrome 
from Dunbar: ¥. “*) including problems with hostility, authority, especially where 
women were concerned, lack of full utilization of his intelligence, problems of im- 
potence, and an interest in playing golf. 


*The author is indebted to Dr. Jack Blumenkrantz and Mrs. Dorothy Frey Marshall for their 
ussistance on this project. 
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PROCEDURES 

Four groups of variables are used in this analysis: one measure of blood pressure, 
nine measures based on content analyses of psychotherapy interviews, four measures 
based on records of the patient’s behavior on the ward, and one measure of the 
amount of drugs being taken by the patient. The inter-correlations among these 15 
variables as measured over the nine-month period were then factor analyzed by the 
centroid method, and the resulting centroid factors were rotated orthogonally until 
what appeared to approximate simple structure was achieved. The patterns which 
emerged are discussed in the results section. 

The patient’s blood pressure was measured daily by the medical staff over the 
period of this study, but the psychotherapist avoided any knowledge of this variable 
in order to prevent any methodological contamination between this variable and the 
variables measured in the therapy sessions. Mean blood pressure 
(2 Diastolic + Systolic) 


-——— - was used as the score in this analysis. 
3 


The patient was seen in 75*biweekly therapy sessions over the nine-month 
period. The therapist did not introduce topics of discussion. During and after each 
therapy hour the therapist made detailed descriptive notes of the session which were 
subsequently rated by two judges for the presence of the following variables. (The 
patient had vigorously objected to the use of a tape recorder.) The inter-rater re- 
liabilities for each variable are presented after the title of the variable. 

) 


2. Hostility (.88). Any discussion of the patient’s feelings of anger or hostile acts 


3. Identification (.70). Discussion of the patient’s conflict about role identity, activity-passivity, 
what he wants to be. 


4. Sez (.78). Discussion of sexual behavior and :mpulses. 


5. Manipulation (.81). Discussion of the patient’s attempts at controlling others or his en- 
vironment. 


Physical comfort (.88). Discussion of health, physical well-being, comfort 
Authority (.67). Discussion of authority problems, rules, superior-peer relations 
Family Origins (.93). Discussion of childhood, home and family. 

Dreams (.99). The patient relates a dream and works with it. 


Patient-T herapist Relations (.77). Discussion of the therapy interaction and feelings about it. 


Notes about the patient’s behavior on the ward by the nursing staff as they ap- 
peared in the chart were recorded for the following variables: 


11. Tension, when the patient was noted as being tense and upset. 
12. Hostile behavior, when the patient displayed such behavior 
13. Complaints, when the patient complained about service, conditions, etc 


14. Acting out, when the patient broke rules, tested limits, and failed to keep appointments 


The last variable included in the analysis is the amount of drugs being given 
the patient by the ward physician. This is included primarily as a control variable 
since the dosage ranged rather widely over the nine-month period. 


RESULTS 
The inter-correlations among the 15 variables are presented in Table 1. In- 
spection of the matrix suggests several clusters of variables within this patient. For 
example, we find that when the patient’s blood pressure was high he was more 
inclined to discuss dreams and identification problems, he was more tense and less 
hostile on the ward, and he was receiving smaller drug dosages. These patterns be- 
come clear when the results of the factor analysis are examined. Five factors emerged 
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TABLE 1. INTERCORRELATIONS AMONG THE 15 VARIABLES 





Variable 1 ‘ K 4 5 j 7 9 





l ‘ ‘ 2% 70 
2 ‘ 21 
3 42 

‘ 43 
30 
28 


7 
8 74 
9 
10 
1] 
12 
13 
14 
15 





from the analysis and the rotated factor matrix is presented in Table 2. All loadings 

of. 35 and above have been marked with an asterisk to indicate that they probably 

have a stable relationship to the factor. All 5 factors seem to be meaningfully defined. 
TaBLeE 2. Rotratep Factor Matrix: HYPERTENSIVE PATIENT Factors 








Variable : , Cc 





Mean Blood Pressure 
Hostility 
Identification 

Sex 

Manipulation 
Physical Comfort 
Authority 

Family Origins 
Dreams 
Patient-Therapist Relations 
Tension 

Hostile Behavior 
Complaints 

Acting Out 

Drugs 





*Significant loading. 


INTERPRETATION OF THE FACTORS 


Factor A seems to be a classic hypertensive pattern as it has emerged in this 
individual. This seems to be an emotional state factor. 
Variable Loading 
13. Complaints .76 
14. Acting Out 75 
9. Dreams 55 
12. Hostile Behavior 41 
1. Blood Pressure 40 
11. Tension 36 


This pattern is of considerable significance for psychosomatic theory. In this patient 
we find complaining, rule breaking, hostile, tense behavior varying over time with 
fluctuations in blood pressure. However, we do not find that his discussion of hostil- 
ity is associated with this pattern. In fact, during these periods of time, he seems less 
inclined to discuss hostility (—.22) which would be quite consistent with the theoret- 
ical position that the repression of hostile ideation is essential to the elevation of 
blood pressure in a hypertensive. 
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Factor B may best be described as an interpersonal relations factor. 


Variable Loading 
Patient-Therapist Relations 71 
Manipulation 70 
Hostility 63 
Physical Comfort 42 
Sex 50 


All of the variables loaded on this factor are from the therapy material and sug- 
gest a consistent pattern of associations in the patient’s thought. In discussing 
interpersonal relations he tends also to be concerned with his own well-being and 
hostility, but does not associate this with sexuality. Sex, for this patient, has a 
separate and distinct meaning as will be seen in factor D. 


Factor C points out quite clearly the significance of the identification conflict in 
the blood pressure fluctuations of this patient. 


Variable Loading 
1. Blood Pressure 
3. Identification 
2. Hostile behavior 
5. Drugs 


a. 


o 


This factor is also of importance for psychosomatic theory. The fluctuations in this 
patient’s blood pressure seem associated to a marked degree with his concern about 
identification. It will also be noted that during periods of elevated blood pressure and 
concern about identification he is not showing manifest hostility. That the amount 
of drugs being taken by the patient is also negatively loaded in this pattern suggests 
that the drugs did, in fact, help control his blood pressure, but that they also had the 
effect of reducing his motivation to talk about this central conflict area of identi- 
fication. 


Factor D seems simply to illustrate a sex-authority linkage in this patient. 


Variable Loading 
Authority 81 
Sex 43 


Authority problems are associated with sexuality and the sexual act for this patient. 
The content of these discussions frequently involved frustration and impotence 
associated with both spheres. 


Factor E is a pattern of less clarity than the others but may indicate a reflective 
introspective attitude on the part of the patient. 


Variable Loading 
9. Dreams .82 
6. Physical comfort .70 
8. Family origins .56 
14. Acting out — .42 


That this pattern emerges suggests that the patient had periods of time when he was 
actively working on the problems of his well-being, seeking their origin in his early 
experience. During these periods he appears to have been ‘‘well behaved” in the 
sense of conforming to the rules of his hospital society. 


DISCUSSION 
Of the five factors which emerged in this analysis at least two are of consider- 
able significance for psychosomatic theory. For this patient, fluctuation in blood 
pressure over time is clearly related to patterns involving hostility and identification. 
This is supportive of the inference drawn from previous research which indicates 
that hypertensive patients, as a group, have greater problems in these areas than do 
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other patient groups or non-patient groups“: ?: *). However, for this patient, the 
emerging pattern was not one combining hostility and identification, but rather two 
separate patterns. One (Factor A) is a pattern of emotional disturbance and active 
overt acting out of the hostility without any evidence of awareness of the significance 
of this behavior. The other is a consideration of problems of identity, accompanied 
by, in fact, lowered manifest hostility. This clearly supports Alexander’s assertion 
that in the hypertensive the fluctuations in blood pressure are a direct result of 
fluctuations in the hostile impulses“: »- 2). Also this indicates that the identifica- 
tion problems which Kalis, Harris, et al.’ emphasize in connection with the hyper- 
tensive and which Alexander sees as a “‘characteristic psychodynamic structure”’ 
'p. 2) have a direct and fluctuating covariance with elevations in the patient’s 
blood pressure. 

Of the three other patterns which emerged at least one, the sex-authority factor 
(D), is also referred to in the literature ®: »- 7*®). This factor, however, is not related 
to the fluctuation in blood pressure and, for this patient at least, must be regarded 
as an independent aspect of his personality. There is also evidence that this patient 
linked hostility and manipulation with interpersonal relations (Factor B) and that 
he had definite periods of seeking an answer in his early experiences (Factor E) but 
these patterns, also, did not covary with his blood pressure. 


SUMMARY 


This study is a factor analysis of 15 variables measured over time for a single 
hypertensive patient. The hypothesis investigated is that emotional and behavioral 
variables which are significant in fluctuations of the patient’s blood pressure will 
emerge from this analysis. Five factors were isolated, two of which were loaded with 
the blood pressure measure. These two seem significant for psychosomatic theory 
since one of the factors is related to overt hostility and the other to problems of 
identification. 
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EDITORIAL OPINION 





THE FUTURE OF APA DIVISION 12 


The Winter 1961 Newsletter of the Division of Clinical Psychology contains a letter 
by Victor Raimy suggesting a reorganization of Division 12 to include within it 
an “AMERICAN Society OF CLINICAL PsycHOLOGy” with membership open to 
Ph.D.’s professionally qualified to practice clinical psychology. The raison d’etre 
of the new ASCP would be to provide the mechanism for a strong social action group 
in our rapidly coming-of-age profession which is increasing concerned with its own 
impetus and enormous potential as a profession. 

There is some question whether simply forming an ASCP within Division 12 
consisting of the same persons who now dominate Division 12 membership would 
solve the issues which have resulted in dissatisfaction with Div. 12 in its present 
composition. There is a great danger of creating an elite guard within clinical psycho- 
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logy comparable to the GRouP FOR THE ADVANCEMENT OF Psycuiatry (GAP) 
which created such dissension in the AMERICAN PsycHIATRIC ASSOCIATION a few 
years ago when it constituted itself as a self-appointed action group to guide psy- 
chiatry. We see no purpose in duplicating an elite group whose membership and 
functions would be overlapping with Div. 12 in important respects. 

The real split within Div. 12 exists between the academic professors who have 
dominated the Division for years and the practitioners in the field who want a truly 
professional association to represent them. This phenomenon is not limited to clin- 
ical psychology but has existed in other clinical professions where the needs of basic 
science professors and practicing clinicians have differed greatly. In clinical psy- 
chology there has never been a strong association of professional clinicians who 
derive a major portion of their income from actual practice. For many reasons, there 
has been a tendency of academicians to look down upon full time clinicians and upon 
those in private practice. Conversely, there has been some tendency among practi- 
tioners to resent academicians. In the history of the Division of Abnormal and 
Clinical Psychology, there has been a tendency for the academicians to be interested 
primarily in abnormal psychology, and for the practitioners to be more interested in 
the professional promotional activities of the division. There is serious question 
whether the two interest groups belong together and whether their joint interests 
would be better served by having two separate professional organizations. 

In our opinion, Division 12 might better be left in its present form as a special 
interest group within the AMERICAN PsyCcHOLOGICAL ASSOCIATION serving the same 
functions as the specialty sections of the AMERICAN MEpICAL AssOcIATION. We be- 
lieve that the proposed AMERICAN Society OF CLINICAL PsycHoLoGy should be 
organized independently of the APA and to represent primarily the interest of 
practicing clinicians as their social action group. The ASCP would thus function in 
the same independent capacity as does the AMERICAN PsycHIATRIC ASSOCIATION 
having its own organization and functions. 

Personally, we are unwilling to see any substantial increase in the dues of 
Div. 12 simply to create another bureaucracy which does not solve the basic issues at 
stake. For Div. 12 to set up its own executive secretary and offices would be the 
first step in duplicating functions which properly belong to the APA and which 
should not come into competition with the APA. There is some question as to what 
faction in the present Div. 12 such an executive secretary and office would represent. 
Certainly, there is no reason to believe that it would fairly represent the practicing 
clinicians who are clamoring for representation. And here, we would like to make a 
suggestion whereby the practicing clinicians could be assured better representation 
in Div. 12 business. It is simply the suggestion that academicians and practitioners 
should alternate in the Presidency and important appointments within the Division 
so that all groups might have representation. 

We would like to secure opinions from Division 12 members concerning the 
feasibility of these suggestions and would be glad to have an informal discussion of 
these alternatives at the coming 1961 Business Meeting of Division 12. 
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PECTS OF HYPNOSIS. Pub. Oct. ’61, 112 
pp., Lil. 


H. S. Burr—THE NEURAL BASIS OF 
HUMAN BEHAVIOR. Pub. Dec. ’60, 272 
pp., 8 il., $7.50 


] Arthur C. Carr et al—THE PREDICTION 
OF OVERT BEHAVIOR THROUGH THE 
USE OF PROJECTIVE TECHNIQUES. 
Pub. Dec. '60, 192 pp., 23 il. (Amer. Lec. 
Psychology), $6.75 


C. J. Ducasse—A CRITICAL EXAMINA- 
TION OF THE BELIEF IN ALIFE AFTER 
DEATH. Pub. Feb. ’61, 336 pp. (Amer. Lec. 
Philosophy), $875. 


-) Malinda Dean Garton—TEACHING THE 


EDUCABLE MENTALLY RETARDED— 
PRACTICAL METHODS. Pub. March ’61, 
248 pp., 147 il., $7.50 


] Andrew Gemani—THE NATURE OF THE 
GENIUS. Pub. May ’61, 216 pp., $6.50 


R. W. Gerard et al—BRAIN MECHAN- 
ISMS AND LEARNING. Pub. Aug. ’61, 
648 pp., 2465 il. 


] J. C. Glidewell—PARENTAL ATTITUDES 
AND CHILD BEHAVIOR. Pub. June ’61, 
272 pp., 18 il., $8.50 


] Milton Greenblai—MENTAL PATIENTS 
IN TRANSITION. Pub. July ’61, 408 pp., 
16 il. 


-} Molly Harrower—THE PRACTICE OF 
CLINICAL PSYCHOLOGY. Pub. Oct. ’61, 
(Amer. Lec. Psychology) 


‘) Louis Hausman—ILLUSTRATIONS OF 
THE NERVOUS SYSTEM. Atlas III. 
Pub. April ’61, 208 pp. (8 3/8 x 10 7/8), 546 
il., $9.50 


[] Irving Louis Horowitz — PHILOSOPHY, 


SCIENCE AND THE SOCIOLOGY OF 
KNOWLEDGE. Pub. Jan. ’61, 192 pp. 
(Amer. Lec. Philosophy), $7.75 


] Robert N. 


_| Jerome Kagan and Gerald S. Lesser—CON- 


TEMPORARY ISSUES IN THEMATIC 
APPERCEPTIVE METHODS. Pub. March 
’61, 352 pp., $12.00 


Alfred H. Katz—PARENTS OF THE 
HANDICAPPED: _ Self-Organized Par- 
ents’ and Relatives’ Groups for Treat- 
ment of Ill and Handicapped Children. 
Pub. Jan. 61, 168 pp., $6.00 


Nathaniel M. Levin—VOICE AND SPEECH 
DISORDERS. Pub. Aug. ’61, 944 pp., 450 il. 


) Arnold M. Ludwig—THE MYSTERY OF 


LIFE. Pub. Sept. 61, 160 pp. 


John M. Macdonald—THE MURDERER 
AND HIS VICTIM. Pub. May ’61, 436 pp., 
$10.50. 


} John M. Martin—JUVENILE VANDAL- 


ISM: A Study of Its Nature and Prevention. 
Pub. Feb. ’61, 160 pp., 12 il., $6.50 


| M.F. Ashley Montagu—GENETIC MECH- 


ANISMS IN HUMAN DISEASE. Pub. Oct. 
61, 512 pp., 365 il. 


_| M. F. Ashley Montagu—PRENATAL IN- 


FLUENCES. Pub. Oct. ’61. 


| Charles E. Palmer—SPEECH AND HEAR- 


ING PROBLEMS. Pub. May ’61, 152 pp., 
15 il., $5.50 


‘| J. H. Quastel and David M. J. Quastel—THE 


CHEMISTRY OF BRAIN METABOLISM 
IN HEALTH AND DISEASE. Pub. March 
61, 184 pp., 23 il. (Amer. Lec. Living Chem- 
istry), $6.50 


Rapoport—COMMUNITY AS 
DOCTOR: New Perspectives on a Thera- 
peutic Community. Pub. April ’61, 336 pp., 
14 il., $9.75 


_| Elizabeth Lodge Rees—A DOCTOR LOOKS 


AT TOYS. Pub. Aug. ’61, 208 pp., 31 il. 


Albert Scheflen—A PSYCHOTHERAPY OF 
SCHIZOPHRENIA. Pub. March ’61, 304 
pp., 1 il., $8.50 


| Alexander Simon et al—THE PHYSIOL- 


OGY OF EMOTIONS. Pub. April ’61, 260 
pp., 20 il., $8.50 


() R. H. Smythe—ANIMAL VISION. Pub. 


May ’61 


R. H. Smythn—ANIMAL PSYCHOLOGY. 
Pub. Sept. ’61 


| R. H. SmythHOW ANIMALS TALK. 


Pub. Nov. 61, 110 pp., 10 il. 


_) R. H. Smythe—THE MIND OF THE DOG. 


Pub. Nov. ’61, 118 pp., 7 il. 
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“PRE NOW tn tOStS.... ci eee acee 


How would you like to use a pro- 
jective personality test which can 
be scored quickly and objectively? 
Whose interpretation is based on 
careful factor-analytic studies? Which 
has norms for 11-year-old through 
adult levels (over 1g00 selected Ss)? 
Uses an interesting and disarming 
cartoon-like format? 


Cassel and Kahn 


GROUP 
PERSONALITY 
PROJECTIVE 
TEST 


Order Examiner’s Set including gen- 


eral manual, scoring keys, 10 test 
booklets, and 100 answer sheets, 
enough material for a good-sized 
testing program (and only $13.50). 


* 
For further information, write to 


PSYCHULOGICAL TEST 
SPECIALISTS 
Box 1441 
Missoula, Montana 


“‘The new in tests 








NEW TEST!!! 











When the Mountain Did Go to Mahomet 


For the parents of retarded or emotionally dis- 
turbed children, transportation expenses for enroll- 
ment and visits at a usually distant treatment 
center — added to the cost of residential treatment 
itself — could, on occasion, make it impossible for 
the parents to give their child the benefits of an 
individualized, twenty-four-hours-a-day program, 
under full professional guidance. 


Cognizant of this factor, The Devereux Founda- 
tion has pioneered three branches, which, in effect, 
make it one of the most accessible residential treat- 
ment centers in the United States. At each branch 
outstanding therapeutic, educational, and voca- 
tional services are available. 


Serving the East Coast, Devereux Schoo!s are located at Devon, Pennsylvania (Mr. 
Charles J. Fowler, Director of Admissions); serving the West Coast, Devereux Schools at 
Santa Barbara, California (Mr. Keith A. Seaton, Registrar); and serving the Southwest, 
Devereux Schools at Victoria, Texas (Mr. John M. Barclay, Director of Development). 
These will welcome your inquiry. 
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CAMPS 


A nonprofit organization 


Founded 1912 TRAINING 
Devon, Pennsylvania RESEARCH 


Santa Barbara, California 





Victoria, Texas 


HELENA T. DEVEREUX EDWARD L. FRENCH, Ph.D. 
Administrative Consultant Director 





























